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TTCAGGAATT GATTGTAAGT TCTGTTATCA 
CTGATATGCT CTAGAACAAC ATTGCTAAGG 
TTCACTGGTG TCGGTGAGCG TACCCGTGAG 
AATGCAAGAG ACCCGTGTCA TTCAACTCGA 
5 TTGTCTTCGG TCAAATGAAC GAGCCCCCTG 
CTTACCGGTT TGACCATTGC TGAATACTTC 
CGGTAGGCTT CATGCTTCTA TCGCTAGGGG 
TCGCTTTTCT AGTGCTTCTC TTTATTGACA 
GCTGGTTCTG AAGTGTC TGC CTTGCTCGGT 
10 TTACCAACCT ACTCTCGCCG TCGACATGGG 
CCACCACCAC CAAGGGATCC ATTAC TTC AG 



ACTAAAGCCG ACAGCGGTTG 50 

CTCACGGTGG TTACTCCGTG 100 

GGTAACGATT TGTACCATGA 150 

CGGAGAGTCC AAGGTCGCTC 2 00 

GTGCCCGTGC CCGTGTTGCC 2 50 

CGTGACGAGG AAGGCCAAGA 300 

CGTGTGATAC AGGAGGCTAA 3 50 

ACATTTTCCG TTTCACTCAA 400 

CGTATTCCTT CCGCTGTCGG 450 

TGTTATGCAG GAACGTATCA 5 00 

TGC A 534 



15 2 ) INFORMATION FOR SEQ ID NO: 480 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 494 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 { vi ) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480 



CGTCTTGATT -CAAGAATTGA -TCAACAACAT CGGGAAGGCC CACGGTGGTT 50 

ACTCGGTCTT CACCGGTGTC GGTGAGCGAA CTCGAGAGGG TAACGATCTG 100 

TACCACGAAG TGAGTTGCGC CGTCCGAGTG TTTCCCGGGG AATCGCAAGA 150 

CTGATGTTGT CCCTTCTTCT CAGATGAGGG AAACCGGTGT CATCAACCTC 2 00 

35 GAGGGTGACT CCAAGGTCGC CTTGGTCTTC GGTCAGATGA ACGAGCCCCC 25 0 

TGGAGCC CGA GCCCGAGTCG CCTTGACCGG TTTGACCATT GCCGAATACT 3 00 

TCCGAGACGA GGAGGGTCAG GATGTCTTGT TGTTCATTGA CAACATTTTC 3 50 

CGATTCACCC AAGCCGGTTC CG AAGTGTC C GCCTTGTTGG GTCGTATCCC 400 

CTCCGCCGTC GGTTACCAGC CCACTCTGTC CACCGACATG GGTACCATGC 45 0 

40 AGGAGCGAAT TACCACCACC AAGAAGGGTT CCATCACTTC CGTC 494 



45 



2 ) INFORMATION FOR SEQ ID NO: 481 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 415 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481 

60 TTCAGGAGCT TATCAACAAC ATCGCCAAGG CCCACGGTGG TTACTCCGTC 5 0 

498 
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10 



25 



55 



TTCACTGGTG TCGGTGAGCG AACTCGTGAG GGTAACGATC TGT AC C AC G A 10 0 

AATGCAGGAG ACTTCCGTCA TTCAGCTTGA TGGCGAGTCC AAGGTCGCCC 15 0 

TGGTCTTCGG TCAGATGAAC GAGCCCCCTG GAGCTCGTGC CCGTGTCGCC 2 00 

CTTACCGGTC TTACTGTAGC TGAATACTTC AGAGATGAGG AGGGTCAGGA 25 0 

CGTGCTGCTC TTCATTGACA ACATTTTCCG ATTCACTCAG GCCGGTTCCG 3 00 

AGGTGTCTGC CCTTCTCGGT CGTATCCCCT CTGCCGTCGG TTACCAGCCC 3 50 

ACCCTCGCCG TCGACATGGG TGGTATGCAA GAGCGTATTA CCACCACCAC 400 

CAAGGGTTCC ATTAC 415 



2 ) INFORMATION FOR SEQ ID NO: 482 



( i ) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 1281 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Geo tri chum spp, 

(B) STRAIN: Lev- 4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482 



AGGACAACCT CCCCGCTATT CTTAACGCTC TTGAGCTTAA GAGAGATAAC 50 

GGTGAGAAGC TCGTTCTCGA GGTTGCCCAG CATCTGGGTG AGAAC AC TGT 100 

30 C C GT ACT ATT GCTATGGACG GTAC TGAGGG TCTCGTCCGT GGTCAGCCCG 15 0 

TTGTTGACAC CGGTGCCCCC ATTACCATTC CCGTTGGTCG TGGTACTCTT 2 00 

GGTAGAATTA TCAACGTCAT TGGTGAGCCC ATCGATGAGC GTGGACCCAT 250 

TGAGGCTACC AAGTACCTCC CCATCCACAC CGAGCCCCCC ACCTTCGCTG 300 

AGCAGTCTAC CTCCGCTGAG GTTC TTG AG A CTGGTATCAA GGTTGTCGAT 350 

35 CTCCTTGCCC CCTACGCCCG TGGTGGTAAG ATTGGTCTCT TCGGTGGTGC 4 00 

CGGTGTCGGT AAGACCGTTT TCATTCAGGA GC TG ATT AAC AACATTGCCA 450 

AGGCCCATGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCCGT 500 

GAGGGTAACG ATTTGTACCG TGAGATGAAG GAGACCGGTG TCATCAACCT 550 

CGAGGGTGAG TCTAAGGTCG CTCTCGTTTT CGGTCAGATG AACGAGCCCC 600 

40 CTGGAGCCCG TGCCCGTGTT GCTCTTACTG GTCTTACCAT TGCTGAGTAC 650 

TTCCGTGATG AGGAGGGTCA GGATGTGTTG CTCTTCGTTG ACAACATTTT 7 00 

CCGTTTCACT CAGGCCGGTT CCGAGGTGTC TGCCCTTTTG GGTCGTATTC 750 

CCTCCGCTGT CGGTTAC CAG CCCACCCTTG CCACTGATAT GGGTGCCCTG 800 

CAAGAGCGTA TTACCACCAC CCAGAAGGGT TCCGTCACTT CCGTCCAGGC 85 0 

45 CGTCTACGTC CCTGCCGATG ATTTGACCGA TCCTGCCCCT GCTACCACCT 900 

TCGCCCATTT GGATGCCACC ACCGTCTTGT CGCGTTCCAT TTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA TCCCCTTGAT TCCAAGTCTC GTCTTTTGGA 1000 

TATCACCGTC GTTGGCCAGG AGCACTACGA TGTTGCTACC CAGGTCCAGC 1050 

AGACCCTCCA GTCCTACAAG TCTCTTCAGG ATATCATTGC CATTTTGGGT 1100 

50 ATGGATGAGT TGTCTGAGGC TGATAAGCTT ACTGTCGAGC GTGCCCGTAA 1150 
GATCCAGAGA TTCCTTTCCC AGCCCTTCAC TGTCGCTGAG GTTTTCACTG • 1200 

GTATCGAGGG CCGTCTCGTT CCTTTGAAGG ACACTGTTCG CTCTTTCAAG 1250 

GAGATCCTTG AGGGCAAGTA CGACCACCTC C 1281 



2 ) INFORMATION FOR SEQ ID NO : 483 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 586 bases 

499 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulation 

(B) STRAIN: G185A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483 



ATTCAAGAAT TGATCGTACG TTCCTCCGCC CCACACACGA TCAATGGAGA 50 

AAGAAACAAA TTTTTTGGGA GTGGTCATTT TTTC TAATAA TTCGAATAGA 100 

15 ACAACATTGC CAAAGCCCAC GGTGGTTACT CCGTGTTCAC TGGTGTCGGC 150 

GAGCGGACCC GTGAAGGAAA TGACTTGTAC CACGAAATGC AGGAAACCCG 200 

TGTTATCCAG CTCGATGGAG AGTC CAAGGT CGCCCTCGTT TTCGGTCAGA 25 0 

TGAACGAGCC TCCCGGAGCC CGTGCCCGTG TTGCCCTCAC TGGTCTGACC 3 00 

GTTGCCGAGT ACTTCCGTGA CGAGGAAGGC CAAGACGGTA TGTATAAGTA 35 0 

2 0 TACACCGTAG CAAATCAACA CAGAGCTTCA CTCACGCTCG GATTTAGTGC 400 

TTCTCTTCAT C G AC AAC ATT TTCCGCTTCA CTCAGGCCGG TTCCGAAGTG 45 0 

TCTGCCCTGC TCGGCCGTAT TCCCTCCGCC GTCGGTTACC AACCCACCCT 500 

CGCCGTGGAC ATGGGTGGTA TGCAGGAACG TATCACCACT ACCACCAAGG 55 0 

GCTCCATCAC CTCTGTGCAR GCCGTCTACG TCCCCG 586 

25 

2 ) INFORMATION FOR SEQ ID NO: 484 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 
40 (B) STRAIN: ATCC 42132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 484 

GCGTGGCCAG AAGGTCATTG ACACTGGTGC TCCCATCACC ATCCCCGTCG 50 

45 GTGGTGCCAC ACTGGGTCGT ATTCTGAACG TCACGGGTGA CCCTATTGAC 100 

GAGCGTGGCC CCGTTAAGAC TGACGTTTTC CGCCCCATTC ACCGTGACCC 150 

CCCTGCCTTT GTCGAGCAGT CGACTGATGC CGAGATTCTC GAGACTGGTA 200 

TCAAAGTCGT TGACCTGATT GCCCCTTACG CCCGTGGTGG TAAGATTGGT 250 

CTGTTCGGTG GTGCCGGTGT CGGTAAGACC GTGCTTATCC AGGAGCTCAT 300 

50 CAACAACATC GCCAAGGCCC ACGGTGGTTT CTCCGTGTTC ACTGGTGTCG 350 

GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT GATTGAAACC 400 

GGTGTCATTA ACCTCGAGGG TG AC TCGAAG GTGGCTCTGG TGTTCGGTCA 450 

GATGAACGAG CCCCCGGGTG CCCGTGCGCG TGTCGCTCTT AC TGGTCTG A 500 

CTGTGGCCGA GTACTTCCGT GACGACGAGG GCCAGGATGT GCTGCTGTTC 550 

55 ATTGACAACA TTTTCCGTTT CACCCAGGCC GGTTCGGAGA CTTCGGCTCT 600 

GCTGGGTCGT ATCCCTTCGT CGGTCGGTTA CCAGCCCACT TTGTCGACCG 650 

ATATGGGTGC CATGCAGGAG CGTATCACCA CCACCAAGAA GGGTTCGATT 700 

ACGTCGGTGC AGGCCGTCTA CGTGCCCGCC GATGATGTCA CTGACCCTGC 7 50 

CCCTGCCACT ACCTTCGCCC ACCTTGACGC TACCACTGTG CTTGACCGTT 800 

60 CGATCGCTGA GCTGGGTATC TACCCCGCTG TTGACCCGTT GAACTCGAAC 850 

500 
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TCGCGTATGC TTGACCCCGC TATTGTGGGT 
CTCTGGCGTG CAGAAGCTGC TCCAGGACTA 
TTGCCATTCT GGGTATGGAT GAGCTTTCTG 
GAGCGTGCCC GTAAGATGCA GCGTTTCATG 
5 CCAGGTCTTT ACTGGTATCG AAGGTCGTCT 
TCAAGGCCTG CAAGGAGATC CTGTCGGGCA 



CAGGAGCACT ACGACGTGGC 900 

CAAGTCGCTT CAAGATATCA 95 0 

AGGAGGACAA GCTCACTGTC 1000 

TCGCAGCCTT TCGCTGTCGC 105 0 

TGTTGCCCTG AAGGACACGA 1100 

AGCACGACAA CCTCC 1145 



10 2 ) INFORMATION FOR SEQ ID NO: 485 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1261 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485 



TCCCGCCATC TTCAACGCCC TGGAGGTCCA GGACATGAAG AACGGTGGCC 50 

GC CTTGTTCT GGAGGTTGCC CAGCACCTTG GTGAGAACAC TGTTCGTTGC 100 

ATTGCTATGG ACGGTACCGA GGGTCTTGTC CGTGGTCAGA AGGTCCTTGA 15 0 

CACTGGTGCC CCGATCACTA TCCCTGTCGG TAACGGTACC TTGGGCCGYA 2 00 

3 0 TCCTGAACGT CACTGGTGAG CCTGTGGATG AGCGTGGTCC GGTTAAGACT 250 

GACGTCTACC OTCCAATCCA CCGTGAGCCC _CCGGCGTTCG TTGAGCAGTC 300 

GACTGATGCT GAGATTCTTG AGAC TGGTAT CAAGGTGGTC GACCTGCTCG 35 0 

CCCCATACGC TCGTGGTGGT AAGATTGGTC TWTTCGGTGG TGCYGGTGTY 400 

GGTAAGACCG TGCTGATTCA GGAGCTTATC AACAACATTG CCAAGGCCCA 450 

3 5 CGGTGGTTTC TCGGTGTTCA CTGGTGTCGG TGAGCGTACT CGTGAGGGTA 500 

ACGATCTGTA CCATGAAATG ATTGAGACTG GTGTCATCAA CGTTGACGGT 550 

GACTCGAAGG TCGCTCTCGT GTTCGGTCAG ATGAACGAGC CCCCGGGTGC 600 

CCGTGCCCGT GTCGCCCTGA CYGGTCTGAC CATCGCCGAG TACTTCCGTG 65 0 

ACGACGAGGG TCAGGATGTG CTGCTCTTCA TTGACAACAT TTTCCGTTTC 7 00 

40 ACTCAGGCTG GTTCGGAGAC TTCGGCTCTG CTGGGTCGTA TCCCGTCGGC 750 

TGTCGGTTAC CAGCCKACCC TTGCCACGGA TATGGGTGCC ATGCAGGARC 800 

GTATYACCAC CACCAAGAAG GGTTC GATT A CCTCGGTGCA GGCYGTTTAC 85 0 

GTTCCGGCCG ACGATGTGAC TGACCCTGCC CCGGCCACGA CCTTCGCCCA 900 

CTTGGACGCC ACCACGGTGT TGGACCGTTC GATTGCYGAG CTGGGTATCT 950 

45 ACCCGGCCGT CGACCCGCTG AACTCGAAGT CGCGTATGCT TGACCCGTCA 1000 

ATTGTCGGTG TGGAGCACTA CAACGTTGCT TCGGGTGTCC AGAAGCTTCT 1050 

YCAGGACTAC AAGTCGCTCC AAGATATCAT TGCCATTCTG GGTATGGATG 1100 

AGTTGTCGGA GGAGGACAAG CTCACTGTCG AGCGTGCCCG TAAGATGCAG 115 0 

C GTTTCCTGT CGCAGCCTTT CGCTGTGGCC CAGGTCTTCA CTGGTATCGA 1200 

50 GGGTCGTCTT GTGTCGCTCA AGGACACCAT CAACGCCTGC AAGGAGATTC 1250 

TGTC CGGTAA G 1261 



55 2 ) INFORMATION FOR SEQ ID NO : 486 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1282 bases 

(B) TYPE: Nucleic acid 

6 0 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Metschnikowia pulcherrima 

(B) STRAIN: DSM 7 0336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486 

10 

AGGAGGGCAA CTTGCCAGCT ATCTTCAACG CTTTGACGTT GAAGAACGGC 50 

GACCAGAAGT TGGTCTTGGA GGTGGCCCAG CACTTGGGTG AGAACACCGT 100 

CAGAACCATT GCCATGGACG GTACCGAGGG TTTGGTCAGA GGCGCCTCTG 150 

TCACCGACAC YGGTGCCCCT ATCTCCGTGC CTGTCGGCCG TGAGACCTTG 2 00 

15 GGTCGTATTA TCAATGTTGT TGGTGAGCCA ATCGACGAGA GAGGCCCAAT 2 50 

CAACACCAAG AAGAGAAACC CTATTCACAC CGACCCACCT TCGTTTGTCC 300 

AGCAATCCAC TTCCGCCGAG GTCTTGGAGA CTGGTATCAA GGTTGTCGAC 3 50 

TTGTTGGCCC CTTACGCCAG AGGTGGTAAG ATTGGTTTGT TCGGTGGTGC 400 

CGGTGTCGGT AAGACCGTGT TCATCCAGGA GTTGATTAAC AACATTGCCA 450 

2 0 AGGCCCACGG TGGTTTCTCC GTTTTC AC CG GTGTCGGTGA GAG AAC C AG A 5 00 

GAGGGTAACG ATTTGTACCG TGAAATGCAG GAGAC TGGTG TCATCAACTT 550 

CGAGGGTGAC TCCAAGGTCG CCTTGGTTTT CGGTCAGATG AACGAGCCTC 600 

CAGGAGCTAG AGCYAGAGTT GCCTTGACCG GTTTGACCAT CGCCGAGTAC 650 

TTCAGAGACG AGGAGGGTCA GGATGTGTTG TTGTTCGTCG ACAACATTTT 7 00 

25 C AGATTC AC C CAAGCCGGTT CTGAGGTGTC TGCCTTGTTG GGTCGTATTC 750 

CATCCGCTGT CGGATACCAG CCAACCTTGG CCACCGATAT GGGTGCCTTG 800 

CAGGAGAGAA TTACCACCAC CAAGAAGGGT TCCGTCACCT CCGTCCAGGC 850 

CGTCTACGTG CCAGCCGATG ACTTGACTGA CCCTGCGCCA GCCACCACTT 900 

TCGCCCACTT GGACGCCACC ACTGTGTTGT CCAGAGGTAT CTCTGAGTTG 950 

3 0 GGTATCTACC CCGCTGTCGA CCCCTTGGAC TCCAACTCCA GATTGTTGGA 1000 

CGCCACCGTT GTTGGCCAGG AGCACTACGA CGTCGCCACC AACGTCCAGC 1050 

AAACTTTACA AG C TT AC AAG - TCCTTGCAGG ATATGATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCCGAGAC CCGACAAGTT GACCCGTCGA GAGAGCCAGA 1150 

AAGATCCAGA AGTTCTTGTC CCAGCCATTT GCCGTCGCCG AGGTTTTCAC 12 00 

3 5 CGGTATTGAG GGTAGATTGG TTAGATTGGA GGACACCGTT AGATCCTTTA 12 50 

AGGAGGTTTT GGAAGGTAAG TACGACCACT TG 12 82 



4 0 2 ) INFORMATION FOR SEQ ID NO: 487 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 482 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: WSA-214 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487 



TGTCTTTATC CAGGAGTTGA TTGTACGTCT TTACCTTTCT GCCTGACTGT 50 

TTACGACAAC TAACGAAAGC GTAGAACAAC ATTGC CAAGG CTCACGGTGG 100 

TTACTCTGTC TTCACTGGTG TCGGTGAACG TACTCGTGAG GGTAACGATT 150 

TGTACCACGA AATGCAGGAA ACTGGTGTCA TTCAGCTCGA GGGTGAATCC 2 00 

60 AAGGTCGCCC TCGTGTTCGG TCAGATGAAC GAGCCCCCCG GTGCCCGTGC 250 

502 
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CCGTGTCGCT CTTACTGGTT 
AAGGTCAGGA CGTGCTTCTC 
GCCGGTTCTG AGGTGTCTGC 
TTACCAGCCC ACCCTTGCCG 
5 CCACCACCAC CAAGGGTTCC 



TGACCATTGC CGAGTACTTC 
TTCATTGACA ACATTTTCCG 
CCTTCTGGGT CGTATCCCCT 
TCGACATGGG TATCATGCAG 
ATCACCTCCG TC 



CGTGACGAGG 300 
TTTCACTCAG 3 50 

CTGCCGTCGG 4 00 

GAGCGTATTA 4 50 

482 



10 



2 ) INFORMATION FOR SEQ ID NO: 488 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1290 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

2 0 (A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488 

25 TCGAACAAGG TAACTT AC C A GCTATCTTGA ATGCTTTAGA AATCAAAACC 50 

C C AAGTGGT A CTCCATTAGT TTTAGAAGTT GCTCAACATT TAGGTGAAAA 100 

CACTGTCAGA AC TATTGC TA TGGATGGTAC TGAAGGTTTA GTTCGTGGTG 15 0 

AACAAGTTAC TGATACTGGT TCTCCAATCA CTGTCCCAGT TGGTCGTGAA 200 

AC TTT AGGTC GTATTATCAA CGTTGTTGGT GAACCAATTG ATGAACGTGG 250 

3 0 TCCAATTAAC AC C AAAC AAA GAAACCCAAT TCACGCTGAA CCACCTTCAT 300 

TCAGTGAACA ATCAACTGCT GCTGAAGTTT TAGAAACTGG TATCAAAGTT 350 

GTYGATTTAT TAGCTCCATA CGCTAGAGGT GGTAAAATTG GTTTATTCGG 400 

TGGTGCCGGT GTCGGTAAAA C TGTCTTT AT CCAAGAATTG ATTAACAACA 45 0 

TTGC T AAAGC TCATGGTGGT TTCTCAGTTT TCACCGGTGT TGGTGAAAGA 500 

3 5 AC C AG AG AAG GTAACGATTT AT AC CG TGAA ATGAAAGAAA CTGGTGTTAT 550 

TAACTTGGAA GGTGATTCTA AGGTCGCTTT AGTTTTCGGT CAAATGAATG 600 

AACCACCAGG TGCTAGAGCT CGTGTTGCTT TAACTGGTTT GACCATTGCT 650 

GAATAC TTC A GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCGTTGATAA 700 

CATTTTCAGA TTCACCCAAG CCGGTTCAGA AGTTTCTGCC TTATTAGGTC 750 

40 GT ATTC CATC TGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 800 

TTGTTACAAG AACGTATTAC CACCACACAA AAAGGTTCAG TTACTTCTGT 850 

CCAAGCTGTT T ATGTC C C AG CTGATGATTT AACAGATCCT GCTCCAGCTA 900 

CCACTTTCGC CCATTTGGAT GCTACTACTG TCTTGTCTCG TGGTATTTCA 950 

GAATTAGGTA TTTACCCAGC TGTCGATCCA TT AG ATTC T A AATCAAGATT 1000 

45 ATTAGATGCT TCAGTTGTTG GTCAAGAACA TTATGATGTT GCTACCAACG 1050 

TTCAACAAAC TTTACAAGCT TACAAATCTT TACAAGATAT TATTGC TATT 1100 

TTAGGTATGG ATGAATTGTC TGAACAAGAT AAATTGACTG TCGAAAGAGC 1150 

AAGAAAAATC CAAAGATTCT TATCTCAACC ATTTGCTGTT GCCGAAGTTT 1200 

TCACTGGTAT YCCAGGTAGA TTGGTTAGAT TAAAAGACAC TATCAAATCA 1250 

50 TTCAAAGATG TTTTGGAAGG TAAATATGAT C ACTTAC C AG 1290 



2 ) INFORMATION FOR SEQ ID NO : 489 

55 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1291 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 



503 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 2149 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489 

10 CCAATTCGAA CAAGGTAACT TACCAGCTAT CTTGAATGCT TTAGAAATCA 50 

AAACCCCAAG TGGTACTCCA TTAGTTTTAG AAGTTGCTCA ACATTTAGGT 100 

GAAAACACTG TCAGAACTAT TGCTATGGAT GGTACTGAAG GTTTAGTCCG 15 0 

TGGTGAACAA GTTACTGATA CTGGTTCTCC AATC AC TGTC CCAGTTGGTC 2 00 

GTGAAACTTT AGGTCGTATT ATCAACGTTG TTGGTGAACC AATTGATGAA 250 

15 CGTGGTCCAA TTAACACCAA ACAAAGAAAC CCAATTCACG CTGAACCACC 300 

TTCATTCAGT GAACAATCAA CTGCTGCTGA AGTTTTAGAA ACTGGAATCA 350 

AAGTTGTTGA TTTATTAGCT CCATACGCTA GAGGTGGTAA AATTGGTTTA 400 

TTCGGTGGTG CCGGTGTCGG TAAAACTGTC TTTATCCAAG AATTGATTAA 450 

CAACATTGCT AAAGCTCATG GTGGTTTCTC AGTTTTCACC GGTGTTGGTG 500 

2 0 AAAGAACCAG AGAAGGTAAC GATTTATACC GTGAAATGAA AGAAACTGGT 550 

GTTATTAACT TGGAAGGTGA TTCTAAGGTC GCTTTAGTTT TCGGTCAAAT 600 

GAATGAACCA CCAGGTGCTA GAGCTCGTGT TGCTTTAACT GGTTTGAC C A 650 

TTGC TGAAT A CTTCAGAGAT GAAGAAGGTC AAGATGTCTT GTTATTCGTT 7 00 

GATAACATTT TCAGATTCAC CCAAGCCGGT TCAGAAGTTT CTGCCTTATT 750 

25 AGGTCGTATT CCATCTGCTG TCGGTTATCA ACCAACTTTA GCAACTGATA 800 

TGGGTTTGTT ACAAGAACGT ATTACCACCA CACAAAAAGG TTCAGTTACT 850 

TCTGTCCAAG CTGTTTATGT CCCAGCTGAT GATTTAACAG ATCCTGCTCC 900 

AGCTACCACT TTCGCC C ATT TGGATGCTAC TACTGTCTTG TCTCGTGGTA 950 

TTTCAGAATT AGGTATTTAC CCAGCTGTCG ATCCATTAGA TTCTAAATCA 1000 

30 AGATTATTAG ATGCTTCAGT TGTTGGTCAA GAACATTATG ATGTTGCTAC 105 0 

CAACGTTCAA CAAACTTTAC AAGCTTACAA ATCTTTACAA GATATTATTG 1100 

CTATTTTAGG TATGGATGAA TTGTCTGAAC AAGATAAATT GACTGTCGAA 1150 

AGAGCAAGAA AAATCCAAAG ATTCTTATCT CAACCATTTG CTGTTGCCGA 1200 

AGTTTTCACT GGTATCCCAG GTAGATTGGT TAGATTAAAA GACACTATCA 1250 

35 AATCATTCAA AGATGTTTTG GAAGGTAAAT ATGATC AC TT A 1291 



2 ) INFORMATION FOR SEQ ID NO: 490 



40 



( i } SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 508 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490 

55 CGTATTGATT CGTGAGTGGC CGTTCCCTTA CAGCAAGCTT ATAAAGGAGC 50 

GAAAAAGATC TGACATTCGG CTTATGTGCT ATACAGAGGA ACTCATCAAC 100 

AACGTCGCCA AGGCTCATGG TGGTTACTCT GTCTTCACCG GTGTCGGAGA 150 

GCGAACACGT GAAGGTAACG ATCTCTACCA CGAAATGATT GAAACCGGTG 2 00 

TCATTCAGCT CAAGAACGAC AAGTCCAAGG CCGCTCTGGT CTTCGGACAG 2 50 

60 ATGAACGAGC CCCCCGGAGC TCGTGCCCGT GTCGCTCTGA CCGGTCTCAC 300 

504 
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CATCGCCGAG 
TCGACAACAT 
CTCGGACGTA 
TATGGGTGGT 
CCTCCGTC 



TACTTCCGTG 
TTTCCGATTC 
TCCCATCTGC 
ATGCAAGAGC 



ACGTCGAAGG 
ACCCAGGCCG 
TGTCGGATAC 
GAATCACAAC 



ACAGGATGTG 
GTTCAGAGGT 
CAGCCCACAC 
CACCAAGAAG 



CTACTCTTCA 
ATCTGCCCTG 
TCTCAACCGA 
GGTTCGATTA 



2 ) INFORMATION FOR SEQ ID NO : 491 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 686 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula mucilaginosa 

(B) STRAIN: ATCC 66034 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491 

TGTCCTCATT CAGGAGCTCA TCAACAACAT CGCCAAGGCC CACGGTGGTT 
ACTCGGTCTT CACCGGTGTC GGCGAGCGTA CCCGTGAGGG TAACGGTGAG 
TCTCCCCCTT C AAAC TTTTG GCCGGCTAGT TGGCGCAGCG CAAACTGACG 
CGCGCGCCCT GTCCAGACTT GTACCACGAG ATGATCGAGA CTGGTGTCAT 
CCAGCTCGAG AACGACAACT CGAAGTGCGC TCTCGTGTTC GGCCAGATGA 
ACGAGCCCCC TGGTGCCCGT GCCCGTGTCG CTCTCACTGG GTTCGTCCTT 
TCTCTCTCTC GAGCGTCCTG GCTTGATACG GAACGCTGAC ACGTCACGCA 
-GTCTCACTAT -TGCTGAGTAC -T-TCCGTGACG AGGAGGGCCA GGACGTGCTC 
CTCTTCATCG ACAACATCTT CCGTTTCACC CAGGGTGAGC CGCCTCCGCG 
GGCATTCTCC CGTTTCTTTC GCGCTGACGT CTGTCCCGTA TAGCCGGTTC 
GGAGGTGTCT GCCCTTCTCG GACGTATCCC GTCCGCTGTC GGATACCAGC 
CGACTCTCTC GACCGACATG GGTCAGATGC AGGAGCGTAT CACGTAAGTT 
TGGCCGCAGC TCCGTCCGCG GCGCCCTTTG TGTCTGACCG TGTTCCACCG 
CTCAGCACCA CCAAGAAGGG CTCGATCACC TGTGTC 



2 ) INFORMATION FOR SEQ ID NO: 492 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 625 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salmoni color 

(B) STRAIN: ATCC 32311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492 

TGTCTTGATT CAGGAGCTCA TCAACAACGT CGCCAAGGCG CACGGTGGTT 
ACTCCGTTTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGGTGAG 
CACACTTTCG CCGACCTGGC CATTCCGGAA CGTGCGGACT GACGAGGACC 

505 



CA 02307010 2000-05-19 



ACCTCGAACA GATCTCTACC ACGAGATGAT 
TCGACAACGA CAAGTCGAAG ACTGCTCTCG 
CCCCCTGGCG CCCGTGCCCG TGTCGCTCTT 
GTACTTCCGT GACGACGAGG GCCAGGACGT 
5 TCTTCCGTTT CACCCAGGGT ACGTTCGATC 
TCGTGGTGAC TGACAACCTG TTGCGCGTGC 
CCCTTCTCGG TCGTATCCCG TCCGCTGTCG 
ACCGACATGG GTGGCATGCA GGAGCGTATC 
TCTCTCGTTT CGCTCTGCAT CGCTCACGCA 
10 CAAGAAGGGT TCGATCACCT GTGTC 



CGAGACCGGT GTCATTCAGC 200 

TCTTCGGCCA GATGAACGAG 25 0 

ACTGGTCTCA CCATCGCGGA 300 

GCTTCTCTTC ATCGACAACA 35 0 

CCGCCCGTCC AACACGAATG 400 

AGCCGGTTCG GAGGTGTCTG 450 

GATACCAGCC CACTCTCTCG 500 

ACGTACGCCC TCTTCTGCTT 550 

TGTTCGCCCG ACAGCACCAC 600 

62 5 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 493 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1211 bases 

(B) TYPE: Nucleic acid 

( c ) S TRANDEDNE S S : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: u , 

(A) ORGANISM: Sporothrlx schenckn 

(B) STRAIN: WSA-148 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493 

TCTCGTCCGT GGTGCTAAGG CCACTGACAC_TGGTTCCCCC 
CCGTCGGCCC CGGTACCCTC GGTC6CATCA TGAACGTCAC 
ATCGACGAGC GCGGTCCCAT CAAGACCGAC AAGTTCCGTC 
TGAGGCTCCC GAGTTCGTTG ACCAGTCGAC CACCGCTGAG 
CTGGTATCAA GGTCGTCGAT CTGCTTGCTC CCTACGCCCG 
ATTGGTCTGT TTGGCGGTGC CGGTGTTGGC AAGACCGTGT 
GCTCATCAAC AACATCGCCA AGGCCCACGG TGGTTACTCC 
GTGTCGGCGA GCGTACCCGT GAGGGTAACG ATCTGTACCA 
GAGACCTCTG TCATTCAGCT TGACGGTGAC TCCAAGGTCG 
CGGTCAGATG AACGAGCCCC CTGGTGCTCG TGCCCGTGTC 
GTTTG AC TGT CGCTGAGTAC TTCCGTGACG AGGAGGGC C A 
TTTTGAATTA TTTCCTTGTC GTACAGTTCC AAATCGAAGA 
TGTCAGTGCT TCTCTTCATC GACAACATTT TCCGCTTCAC 
TCTGAGGTGT CTGCCCTTCT GGGTCGTATT CCCTCCGCTG 
GCCCACGCTC GCCGTGGACA TGGGTCTGAT GCAGGAGCGT 
CCCGCAAGGG CTCAATTACC TCCGTCCAGG CCGTCTACGT 
GATCTGACGG ATCCCGCCCC CGCCACCACC TTCGCCCATC 
CACTGTGCTG TCCCGAGGTA TCTCTGAGCT GGGTATCTAC 
ACCCCCTCGA CTCCAAGTCG CGTATGCTGG ACCCCCGTAT 
GACCACTACG AGACCGCCAC TCGCGTCCAG CAGATCCTCC 
GTCGCTGCAG GACATCATCG CCATTCTGGG TATGGACGAG 
CCGACAAGCT TACAGTCGAG CGTGCTCGTA AGATCCAGCG 
CAGCCGTTCA CGGTCGCGCA GGTCTTCACT GGTATCGAAG 
CGATCTGAAG GACACTATCG CTTCGTTCAA GGCTATCCTG 
GTGACAGCCT T 



ATTACCATCC 50 

CGGTGACCCG 100 

CCATCCACGC 150 

GTTCTCGTGA 200 

TGGTGGTAAG 2 5 0 

TCATCCAGGA 300 

GTCTTCACCG 3 50 

CGAAATGCAG 400 

CCCTGGTGTT 45 0 

GCCTTGACCG 500 

GGATGGTATG 550 

ATTACTAACT 600 

CCAGGCCGGT 650 

TCGGTTACCA 7 00 

ATTACCACCA 750 

GCCCGCTGAC 800 

TGGACGCCAC 850 

CCCGCTGTCG 900 

TGTCGGTGAC 950 

AAGAGTACAA 1000 

CTGTCTGAGG 1050 

TTTCCTGAGC 1100 

GCCAGCTGGT 115 0 

AGC GGTGAGG 12 00 
1211 



2 ) INFORMATION FOR SEQ ID NO : 494 

60 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

{ vi ) ORIGINAL SOURCE: 
10 (A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494 

15 TC TTGTTAG A GGCACCCCAG TCAAGGACAC TGGTGCTCCA ATTACCATTC 5 0 

CAGTTGGTAA CGGCACTTTG GGCCGTATCG TC AACGTTC T CGGTGAGCCA 100 

ATTGATGAGC GTGGACCAGT CAAGGCTGAC AAGTTCAGAC CTATTCACGC 150 

TGAGCCACCA ACCTTCGCTG ACCAGTCCAC CTCTGCCGAG GTTCTTGAGA 200 

CCGGTATTAA GGTTGTCGAC TTGCTTGCCC CTTATGCCAG AGGTGGTAAG 250 

2 0 ATTGGTCTTT TCGGTGGTGC CGGTGTCGGT AAGACTGTGT TCATTCAGGA 300 

GCTTATTAAC AACATTGCTA AGGCCCACGG TGGTTACTCT GTCTTCACTG 350 

GTGTCGGTGA GCGAACTCGT GAAGGTAACG ATTTGTACCA CGAAATGATG 400 

GAGACCGGTG TCATCAACCT TGAGGGTGAC TCCAAGGTGT CTCTTGTGTT 450 

CGGTCAGATG AACGAGCCTC CAGGAGCCCG TGCCCGTGTT GCCTTGACCG 500 

25 GTTTGACCAT TGCCGAGTAC TTCAGAGATG AGGAGGGCCA GGATGTCTTG 55 0 

TTGTTCATTG ACAACATTTT CCGATTCACC CAGGCCGGTT CTGAGGTCTC 600 

TGCCTTGTTG GGTCGTATCC CATCTGCCGT CGGTTACCAA CCAACCTTGG 650 

CTACTGATAT GGGTGGTCTT CAAGAACGTA TTACCACCAC TCAAAAGGGT 700 

TCCGTCACCT CTGTCCAGGC TGTCTACGTC CCAGCTGACG ATTTGACTGA 750 

3 0 TCCTGCCCCA GCTACCACCT TCGCCCATTT GGACGCCACC ACCGAATTGT 800 

CCCGATCTAT CTCTGAGTTG GGTATCTACC CAGCTGTCGA CCCTCTTGGT 85 0 

TCCAAGTCCC GTCTTTTGGA TGCCTCCGTC GTCGC3CCAAG AGCACTACGA 900 

CGTTGCCGCC AACGTCCAAC AGACCTTGCA GGCCTACAAG TCTCTCCAGG 950 

ATATCATTGC CATTTTGGGT ATGGACGAAT TGTCTGAGGC TGATAAGCTC 1000 

3 5 ACTGTCGAGC GTGCTCGTAA GATGCAGAGA TTCCTTTCTC AGCCATTCAC 105 0 

CGTCGCTGAG GTCTTCACTG GTCTCGAGGG TAGACTCGTT TCTTTGAAGG 1100 

ACACCATCCG ATCCTTCAAG GAGATCCTTG ACG ^ 113 3 



40 

2 ) INFORMATION FOR SEQ ID NO: 495 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 608 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA^225 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495 

™>TT™ T GTAA GTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 

^AAAATTG GACAATTGAG CAATTTAGCC ATTGGAGAAA AGAAATTTCG 100 

AGTATTAATT GTTTTTATAG AACAACATTG CCAAGGCTCA CGGTGGTTAC 150 

60 TCTGTCTTCA CTGGTGTCGG AGAGCGTACC CGTGAAGGAA ACGATCTCTA 200 
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CCATGAAATG CAGGAGACCC GTGTCATTCA GCTTGATGGC GAGTCCAAGG 
TCGCCCTGGT CTTCGGCCAG ATGAACGAGC CCCCAGGTGC CCGTGCCCGT 
GTTGCTCTTA CTGGTTTGAC CATTGCTGAG TACTTCCGTG ATGAGGAAGG 
TCAAGACGGT GAGTTTCTTA TGGATAAAAA AAAATTTTTT TTTTTTTTTT 
TTTTTTTCAA GAAATTCATG TTCTAACAAA GTGTATTCTA GTGCTTCTCT 
TCATCGACAA CATTTTCCGT TTCACTCAGG CTGGTTCCGA AGTGTCTGCC 
CTGCTTGGTC GTATCCCATC TGCCGTCGGT TACCAACCCA CTCTTGCCGT 
CGACATGGGT GGTATGCAGG AACGTATTAC CACCACCAAG AAGGGATCCA 
TTACCTCC 



2 ) INFORMATION FOR SEQ ID NO: 496 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dermatitidis 

(B) STRAIN: WSA-229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496 

GTTTATTCAA GAACTCATTG TGGGTGGCAT TCTCATAATG TTTCGGCCAC 
AATTACTGAT TGAAAATAGA ACAACATTGC AAAGGCTCAT GGTGGTTACT 
CCGTGTTCAC TGGTGTCGGC GAGCGAACTC GTGAGGGTAA CGACTTGTAC 
GACGAAATGC AGG AG AC C TC TGTCATTCAG CTCGATGGCG AGTCCAAGGT 
CGCGCTGGTG TTTGGTCAAA TGAACGAACC TCCTGGTGCT CGTGCTCGTG 
TTGCTCTGAC TGGGTAAGTT GTTCCTTCGC TTCTTGCGCG TATCCACATC 
CCCATCTTGA GAATACGTCT GCCACCATGT CATGTGATGT TGGGCTGGTT 
CTGGTTTTTG GGAGGCCCTC AAGTTCAATT TTTGGATGAC AGCACCAGCT 
TTACAAGATT ATGC TAACTT AATGGAGTCT TACGGTGGCT GAGTTCTTCA 
GGGATGAGGA GGGACAGGAT GGTAAGTTTG ATAACAATCT CGTCGGTGTC 
AATATCGACG GCGTACTCTT CGCATCAAAA AACCAAAGAG GTGGTTTGGT 
GTGAGAAGTG CGCCGGAAAT AATGGCAACC ACGTGACAAT GACCACGTGT 
GGGGCTCCCG TGCTAACACG TGACAGTCTT GCTCTTCATC GACAACATTT 
TCCGATTCAC TCAASCCGGT TCTGARGTGT CTGCCTTGCT TGGTCGTATT 
CCATCTGCCG TCGGTTACCA ACCCACACTC GCCGTCGACA TGGGTCTCAT 
GCAGGAACGT ATCACCACCA CCCGGAAGGG ATCCATCACA TCTG 



2 ) INFORMATION FOR SEQ ID NO: 497 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1148 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 38295 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497 



TCTTGTCCGA 
5 CCGTCGGCCG 
ATTGACGAGC 
TGACCCCCCT 
CCGGTATTAA 
ATTGGTCTCT 

10 GCTGATTAAC 
GTGTCGGTGA 
GAG AC TGGTG 
CGGTCAGATG 
GTCTGACCAT 

15 CTCTTCGTTG 
CGCTCTGCTT 
CCACCGATAT 
TCCGTCACTT 
TCCTGCTCCC 

20 CCCGAGGTAT 
TCCAAGTCTC 
TGTTGC TTC C 
ATATCATTGC 
ACCGTCGAGC 

2 5 CGTCGCCGAG 
ACACTGTCCG 



GGCACCGCCG 
AGGTACCCTT 
GAGGACCCAT 
ACCTTCGCTG 
GGTCGTCGAC 
TCGGTGGTGC 
AACATTGCCA 
GCGAACCCGA 
TCATCAACCT 
AACGAGCCTC 
TGCCGAGTAC 
ACAACATTTT 
GGTCGAATTC 
GGGTGCCCTC 
CCGTCCAGGC 
GCCACCACCT 
TTCCGAGCTG 
GACTTCTGGA 
AACGTCCAGC 
CATTCTTGGT 
GAGCTCGAAA 
GTTTTCACCG 
ATCCTTCAAG 



TCGCTGACAC 
GGTCGAATCA 
CGAGGCTTCC 
AGCAGTCTAC 
CTCCTCGCCC 
CGGTGTCGGT 
AGGCCCATGG 
GAGGGTAACG 
C G AGGGTG AG 
CCGGAGCCCG 
TTCCGAGACG 
CCGATTCACC 
CCTCCGCTGT 
CAGGAGCGAA 
CGTCTACGTG 
TCGCCCATCT 
GGTATCTACC 
TATCGATGTT 
AGACCCTCCA 
ATGGATGAGC 
GATCCAGCGA 
GTATTGAGGG 
GAGATCCTTG 



CGGTGCTCCC 
TCAACGTCTG 
AAGTACCTCC 
CTCCGCTGAG 
CTTACGCCCG 
AAGACTGTCT 
TGGTTTCTC C 
ATCTTTACCG 
TCTAAGGTCA 
TGCCCGAGTC 
AGGAGGGTCA 
CAGGCCGGTT 
CGGTTACCAG 
TTACCACCAC 
CCTGCCGATG 
TGACGCCACC 
CCGCTGTCGA 
GTCGGAAAGG 
GGCTTACAAG 
TGTCCGAGCA 
TTCCTGTCTC 
ACGACTTGTC 
ACGGTAAGCA 



ATCACTATCC 5 0 

TGGTGAGCCC 100 

CCATCCACGC 150 

GTTCTCGAGA 200 

AGGTGGTAAG 25 0 

TCATCCAGGA 3 00 

GTTTTCTGCG 350 

AGAGATGAAG 400 

CCCTCGTCTT 450 

GCCCTTACTG 500 

GGATGTGTTG 550 

CCGAGGTGTC 600 

CCCACTCTGG 65 0 

CCAGAAGGGT 700 

ATTTG AC CG A 750 

ACCGTCCTGT 800 

TCCCCTTGAT 85 0 

AGCACTACGA 900 

TCTCTCCAGG 950 

GGACAAGCTG 1000 

AGCCCTTCAC 1050 

TCTCTCAAGG 1100 

CGATGCTC 1148 



30 2 ) INFORMATION FOR SEQ ID NO: 498 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fvmigatus 

(B) STRAIN: WSA-172 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498 



GCGCTATTGT CGTTGTTGCT GCCTCCGACG GTCAGATGTA GGTGGAACAT 50 

CTTGGGAAAT ACGTCGTAAA ACACGTCGCT TACGTTTTCG CGAATAGGCC 100 

CCAGACTCGT GAGCATTTGC TGCTCGCCCG CCAGGTTGGT GTCCAGAAGA 150 

TCGTTGTCTT CGTCAACAAA ATCGATGCTA TTGATGATCC GGAGATGCTG 200 

50 GAACTGGTCG AACTCGAGAT GCGTGAGCTG CTGAACAGCT ACGGTTTCGA 250 

GGGTGAAGAG ACTCCGATCA TTTTCGGTTC CGCTCTCTGT GCTCTCGAAG 3 00 

GACGCCGTGA CGACATCGGT AAAGACAGAA TTGAGCAGCT TATGAACGCT 350 

GTCGACACCT GGATCCCCAC TCCTCAGCGT GACCTCGACA AAC CTTTCTT 400 

GATGTC TGTC GAGGAAGTGT TCTCTATCGC CGGCCGTGGT ACCGTGGCTT 450 

55 CTGGTCGTGT CGAGCGTGGT ATCTTGAAGA AGGACTCTGA GGTTGAGATT 500 

GTTGGAGGCT CCTTCGAACC CAAGAAGACC AAAGTCACCG ACATTGAAAC 550 

CTTCAAGAAG AGCTGTGATG AATCGCGTGC TGGTGACAAC TCTGGTCTCC 600 

TCCTGCGTGG TATCCGACGT GAAGACGTCA AGCGTGGTAT GGTCATTGCT 650 

GTTCCCGGCA GCACCAAGGC TCACGACAAG TTCCTCGTCT CCATGTACGT 700 

60 CCTGACCGAG GCGGAGGGTG GTCGTCGTAC TGGCTTCGGT GCCAACTACC 750 
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GTCCCCAAGT CTTCATCCGT ACTGCAGGTA 
AGATCTTCCG AGAGATTAGC G AT ATATGC T 
TGCTGACCTC AGCTTCCCTG ACGGCGACCA 
GTGACAACGT CGAGATGATC CTGAAGACCC 
5 GCTGGTCAAC GCTTCA 



AGTTCCCGCA CACCGTGTCC 800 
AATGATTCAT CAGACGAGGC 85 0 

ATCTCGCAGA GTTATGCCTG 900 
ACCACCCTGT TGCTGCTGAG 9 50 

966 



2 ) INFORMATION FOR SEQ ID NO : 499 



10 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 499 

25 TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCCCAAACCC 50 

GTGAGCACTT GCTTCTTGCT CGTCAAGTTG GTGTTAAGCA CATTGTTGTT 100 

TTCGTTAACA AGATTGATAC TATTGATGAT CCTGAAATGT TGGAACTTGT 150 

CGAAATGGAA ATGAGAGAAC TTCTTTCTTC TTACGGTTTT GATGGCGATA 200 

ACACCCCTGT CATTATGGGT TCTGCTCTCT GTGCTCTTGA AGGTCGTGAA 2 50 

3 0 C C AG AAATTG GTGAACAAAG AATCAACCAA CTCCTTGATG CTATCGATGA 3 00 

ATACATTCCT ACCCCAGTTC GTGATATGGA CCAACCTTTC TTGATGCCAC 3 50 

TTGAAGGTGT TTTCTCTATT CCAGGTCGTG GTACTGTTGC CACTGGACGT 400 

GTCTATCGTG GTACTTTGAA GAGAGGTGAA GAAGTTGAAG TTGTTGGC T A 45 0 

CAATGATGCT CCAATCAAGA CCACCGTTAC TGGTATTGAA ATGTTCAAGA 500 

35 AGGAACTTGA TCAAGCTCAA GCTGGTGACA ACGCTGGTAT TCTTTTGAGA 550 

GGTGTTAAGC GTGAAGACCT TAAGCGTGGT ATGGTTGTTG CTAAACCAGG 600 

TACCGTTAAG CCACACACCA AGTTCCTTGC CTCCATCTAT GTTTTGACTA 65 0 

AGGAAGAAGG TGGCAGACAC TCTGGCTTTG GTCTTAACTA CAGACCTCAA 700 

CTTTTCCTTG GTTCTGCTGA TGTTACCACT GTCTTGACCT TCCCAGAGGG 7 50 

40 TGTTGACCAA AGCACTCAAG TCATGCCAGG TGACAACACT GAAATGGTTT 800 

GCGAACTTGT TCACCCAGTT GCTGTCGAAC AAGGCCAACG TTTCAA 846 



45 2 ) INFORMATION FOR SEQ ID NO: 500 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500 
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GGTGCCATTA 
TGAGCATCTT 
TTGTTAACAA 
GAGATGGAGA 
5 CTCTCCTGTC 
CCGAGATTGG 
TGGATCCCTA 
TGAGGCCTCT 
TCGAGCGTGG 

10 AACAAGCAGC 
GGAACTTGAT 
GTATTCGTCG 
ACTGTTAAGG 
GGAAGAGGGC 

15 TGTTCGTTTC 
GTTGAACAGC 
CGAGCTCGTT 



TTGTTGTTGC 
TTGCTTGCCC 
GATTGATACC 
TGCGTGAACT 
ATTATGGGCT 
TAAGGACCGT 
CCCCCGAGCG 
TTCTCCATTT 
TATTCTCAAG 
CCCTGAAATC 
CAGGCCCAGG 
TGAGGACTTG 
CTCACACCAA 
GGCCGTCACT 
TGCAGCTGAT 
ACACTCAGGT 
CACCCTACTG 



TGCCTCTGAT 
GCCAAGTCGG 
ATTGATGACC 
GTTGAATGAA 
CTGCTCTTGC 
ATCATGCAGC 
TGACCTTGAC 
CTGGTCGTGG 
AAGGGTGAGG 
TGTTGTTACT 
GCGGTGATAA 
CAGCGTGGTA 
GTTCCTTTCC 
CTCCTTTCGG 
GTC AC CGTTA 
CTTCCCTGGT 
CTATTGAGGT 



GG AC AGATGC 
TATGCAAAAG 
CTGAAATGCT 
TATGACTTCG 
TGCTCTTGAG 
TCTTGGACGC 
AAGCCTTTCA 
TACTGTTGCC 
AAGTC GAG AT 
GGTATTGAAA 
TGCTGGTATC 
TGGTTTTGGC 
TCCATCTACG 
TATGAACTAT 
CTCTTACTTT 
GAGAACACCG 
TGGTCAACGC 



CCCAGACCCG 50 

GTCGTTGTGT 100 

TGATCTTGTC 15 0 

ATGGAGATAA 200 

GACAAGAACC 250 

TGTTGATGAA 300 

TGATGCCTAT 35 0 

ACTGGCCGTG 400 

CGTTGGTTTC 45 0 

TGTTCAAGAA 500 

TTGCTTCGTG 55 0 

CAAGCCTGGA 600 

TTCTCTCCAA 650 

CGTCCCCAGA 700 

CCCTGAGGGT 750 

AGATGGTTGG 800 

TTCAAC 84 6 



20 
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2 ) INFORMATION FOR SEQ ID NO : 501 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 944 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides iwmitls 

(B) STRAIN: Silveira 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501 

AGTTGTCGTC GTTGCTGCTT CAGACGGTCA AATGTATGCA ACCG AG AG C A 50 

CTCCCGGATC TTGGTTTAAA TGGCACTAAT ATAAGACAGG CCTCAAACTC 100 

GAGAGCATTT ACTTCTCGCC CGTCAGATCG GTATCCAAAA AATCGTCGTC 15 0 

40 TTCGTGAACA AGGTTGATGC CATCGAGGAC AAAGAGATGT TGGAGCTTGT 200 

TGAATTGGAG ATGCGTGAAC TCCTAACCAG CTACGGTTTC GAGGGTGAAG 25 0 

AAACTCCCAT CATTTTTGGC TCTGCTCTCT GTGCCCTCGA AGGAAGACAA 3 00 

CCCGAGATCG GTGTTACCAA GATTGATGAG CTCTTGCAGG CCGTCGACAC 35 0 

CTGGATTCCC ACTCCTCAGC GTGAGAC TG A CAAGCCCTTC TTGATGTC C A 400 

45 TTGAGGAAGT GTTCTCTATT TCCGGACGAG GAACCGTTGT CTCCGGCCGT 450 

GTGGAGCGTG GTATCCTCAA GAAGGACTCC GAAGTTGAAA TTGTCGGCGG 500 

TTCGCCCGAG CCAATCAAAA CCAAGGTTAC C G AT ATCGAA ACCTTTAAGA 550 

AGTCTTGCGA CGAGTCTCGC GCTGGTGATA ACTCCGGCTT GCTCCTACGA 600 

GGCGTTAAGC GTGAAGATAT TAGCCGTGGC ATGGTCGTCG CTGTACCAGG 65 0 

50 AAGTGTCAAG GCCCATACTG AATTCTTAGT TTCGCTTTAC GTCCTCACCG 7 00 

AAGCTGAGGG TGGGCGCAAA TCTGGATTCA GCAGCAAGTA CCGCCCACAG 750 

ATGTTCATTC GCACTGCCGG TATGTAATAC TGTGATAATT TCGTTGACAT 800 

GGTACTGATT GAATTCTATA GACGAAGCGG CTCAGCTCAG CTGGCCCGGA 85 0 

GAAGATCAAG ACAAGATGGC TATGCCAGGA GACAATATCG AAATGATTTG 900 

55 CACCACCTTG CACCCAGTTG CCGCCGAGGC TGGCCAGCGA TTCA 944 



2 ) INFORMATION FOR SEQ ID NO: 502 

60 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 849 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502 

15 GCTATCATCG TCGTTGCTGC CTCCGATGGA CAGATGCCCC AGACCCGTGA 50 

ACACTTGCTC CTCGCTCGTC AGGTCGGTGT CCAGCGAATT GTCGTCTTCG 100 

TCAACAAGGT CGATGC C ATT GATGACCCCG AGATGCTTGA GCTCGTCGAG 150 

ATGGAGATGC GCGAGCTTCT TAACACCTAC GGC TTCGAAG GCGACGACAC 200 

TCCCGTCATC ATGGGCTCTG CTCTTATGTC TCTCCAGAAC CAGCGCCCCG 2 50 

20 AGATTGGCAC C G AG AAGATC GATGAGCTCC TTGCTGCCGT CGACGAGTGG 3 00 

ATCCCAACCC CCGAGCGTGA CCTTGACAAG CCCTTCCTTA TGTCCGTCGA 350 

GGATGTCTTC TCCATTGCCG GCCGTGGTAC CGTCGTGTCT GGCCGTGTGG 400 

AGCGTGGTAT TCTGAAGCGT GATCAGGAGA TCGAGCTTGT CGGAAAGGGG 45 0 

CAGGAGGTTA TCAAGACCAA GGTTACCGAC ATTGAGACCT TCAAGAAGTC 500 

25 TTGTGAGCAG TCCCAGGCTG GTGACAACTC TGGTCTCCTC ATCCGAGGTG 550 

TTCGCCGTGA GGATGTCCGC CGTGGTATGG TTGTCTGCGC TCCTGGCACC 600 

GTCAAGTCTC ATACCCAGTT TCTCGCTTCC CTCTACGTCC TCACCAAGGA 650 

GGAGGGTGGC CGACACACCG GTTTCCAGGA GCACTACCGA CCCCAGCTCT 700 

ATCTCCGAAC TGCAGATGAG TCCATTGACC TGACTTTCCC CGAGGGTACT 7 50 

3 0 GAGGATGCCT C C AGTAAG AT GGTCATGCCT GGTGACAACA CCGAGATGGT 800 

—TGTC ACC ATG— GGTC ACC C C A . ATGCC ATCGA..GGTTGGTCAG _C G ATTC AAC 849 



35 2) INFORMATION FOR SEQ ID NO: 503 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1064 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: G186A5 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503 



TGGTGCTATT GTCGTTGTTG CTGCTGCTGA CGGCCAAATG TAAGACGCCG 50 

CGAGGGACTG CTGAGGGTTT TATGCTTTTT AGGCCCCCTT GTTTCTGAGA 100 

GCATGATGAT ACTAATATTC GGAAACGTAT CTATTAGGCC TCAAACACGT 150 

GAGCATTTGC TCCTTGCCCG ACAGGTCGGT GTCCAAAAGA TCGTCGTTTT 200 

55 CGTGAACAAA GTCGACGCCC TTGAGGACAA GGAGATGTTG GAGCTTGTCG 250 

AGTTAGAAAT GAGAGAGCTC TTAAACACCT ACGGCTTCGA GGGTGAAGAG 3 00 

ACACCCATCA TCTTTGGTTC TGCCCTTTGC GCCATGGAAG GCCGTGAGCC 3 50 

TGAGTTGGGA GAAAAGAAAA TTGATGAATT GCTGGAGGCT GTTGATACTT 400 

GGATCCCCAC ACCACAACGT GATACCGAAA AACCTTTCTT GATGTCCGTT 450 

60 GAGGAAGTAT TCTCTATCTC CGGCCGTGGA ACCGTTGCCT CCGGTCGTGT 500 

512 
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TGAGCGCGGT GTCCTCAAGA AGGATTCAGA AGTCGAGCTA ATTGGGGGCG 550 

GCTCCACCCC CATCAGGACG AAGGTAACTG ATATCGAAAC TTTCAAGAAA 600 

TCCTGTGACG AGTCTAGAGC TGGGGACAAC TCCGGTCTTT TATTGCGTGG 650 

TATCAAGCGT GAAGATATCC GCCGTGGTAT GGTAGTTGCC GTTCCTGGCA 7 00 

5 GCGTCAAGGC CCACGACAAG TTCTTGGTGT CGATGTATGT CCTGACCGAA 7 50 

GCTGAGGGTG GTCGCCGAAC CGGATTCGGC CAGAACTATC GTCCTCAAAT 800 

GTTCATCCGC ACAGCTGGTA TGTCAAAATG GGACCCCTTT TCATAATCCT 850 

TTCTTTTTTT CCTTTTCCTC TCTATCTCTC TTTCTGTTTC CTTTCAACTC 900 

GCCTGATTCA CGAAATTAAC TAACCCGTTT GATTATAGAC GAAGCCGCCC 950 

10 ATCTCAGCTT CC CTAGTGG A G C AG ATG AAA GCAAACTCGT TATGCCTGGT 1000 

GACAACGTCG AGATGATCCT CCAGACACAC CGCCCCGTGG CTGCTGAGGC 1050 

CGGCCAGCGA TTCA 1064 



15 



2 ) INFORMATION FOR SEQ ID NO: 504 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 982 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32071 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: .504 

TGGTGC TATC GTCGTTGTTG CTGCCTCTGA CGGCCAAATG TAGGGATTTT 50 

GCAAGACTGG TGAAAAAATC TAAAGAAAAT AGAAAAGATT GTGCTGATGT 100 

TTGGTATCAG GCCCCAAACA CGCGAGCATC TGCTTCTTGC CAGACAAGTC 150 

35 GGTGTTCAGA AAATCGTTGT TTTCGTCAAC AAGGTC G ATG CTGTAGAGGA 2 00 

TAAGGAGATG TTGGAGCTTG TCGAATTGGA GATGAGAGAG CTCTTGAC C A 250 

CCTATGGGTT CGAGGGTGAG AAGACACCTA TCATCTTTGG TTCTGCGCTC 300 

TGTGCTATGG AGGGCCGTCA GCCCGAGTTG GGAGAGCAGA AAATTGATGA 350 

ATTACTCGAG GCTGTGGATA CTTGGATCCC TACGCCACAG CGTGATACTG 400 

40 ACAAGCCCTT CCTGATGTCC ATTGAGGAGG TGTTCTCTAT CTCTGGACGA 450 

GGAACCGTTG CCTCCGGCCG CGTTGAGCGT GGTATCCTCA AGAAGGACTC 500 

CGAAGTTGAA ATTATTGGCG GCGGTGTTCC CACAATCCTG ACCAAGGTGA 550 

CTGATATCGA AACCTTCAAG AAGTCTTGCG ACGAGTCCAG AGCCGGGGAC 600 

AACTCCGGCC TCTTGTTGCG CGGTGTCAAG CGTGAGGATA TCCGCCGTGG 650 

45 TATGGTCGTT GCAGTTCCCG GAAGCGTCAA AGCACATGAC AGATTCTTGG 7 00 

TGTCGATGTA CGTTCTGACC GAGGCTGAGG GTGGTCGCCG CACTGGCTTC 750 

GGTCAGAACT ATCGTCCTCA AATGTTCATC CGCACAGCTG GTACGTTCAT 800 

TCTTTCACTA TATTCCTATA TGCATAGCCC GATCCTCCCA TTAACTAATT 850 

GACACAGACG AGGCTGCTGA ACTCAGCTGG CCTGATGGAG ACGACGAAGC 900 

50 CAAAATGGTC ATGCC CGGTG ACAATGTTGA AATGGTCCTG AAGTCACACC 950 

GCCCGGGGCC GCTGAGGCTG GACAGCGATT CA 982 



55 2 ) INFORMATION FOR SEQ ID NO: 505 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

513 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penlcillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505 

10 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 15 0 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

15 TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 25 0 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 300 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGC CGTT GATACCTGGA 350 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CC TTCTTG AT GTCCGTTGAG 400 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

20 GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTT AC CG AC A TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

25 GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATC CGTAC T GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAAC GAGT TATAGATGAG GCTGCT AC TC TCAGCTTCCC CGGTGACGAT 850 

CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 

30 



2 ) INFORMATION FOR SEQ ID NO : 506 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 
45 (B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 

TGGTGCTATT ATTGTTGTTG C TGCTTCTG A TGGTCAAATG CCTCAAACCA 50 

50 GAGAACATTT RTTATTGGCT AGACAAGTTG GTGTTCAACA CATTGTTGTC 100 

TTTGTTAACA AAGTTGATAC TATTGATGAC CCAGAAATGT TGGAATTAGT 150 

TGAAATGGAA ATGAGAGAAT TGTTAAGTAC TTATGGTTTT GATGGTGATA 200 

ACGTCCCAGT TGTTATGGGT TCTGCTTTAT GTGCCTTGGA AGGTCGTGAA 250 

GAAGAAATTG GTGTCAAAGC TATTGATAAA TTATTAGCTG CTGTTGATGA 3 00 

55 ATATATCCCA ACCCCACAAA GAGATTTAGA AAAACCATTC TTGATGGGTG 350 

TTGAAGATGT CTTYTCAATC TCAGGTAGAG GTACCGTTGT TAC TGGTCGT 400 

GTTGAACGTG GTAACTTGAA GAAAGGTGAT GAAGTTGAAA TTGTTGGTTT 450 

AAACAAAACT CCATTGAAAA CTACTGTYAC NGGTATTGAA ATGTTCAAAA 500 

AAGAATTGGA CCAAGCTATG GCTGGTGATA ACTGTGGTAT CTTATTACGT 550 

60 GGTATCAAAA GAGATGACAT YAAAAGAGGT ATGGTTATTG CTAAAACCGG 600 

514 
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TACCATCTCW GCTCACACTA 
AAGAAGAAGG TGGTCGTCAC 
TTATTCATCA GAACTGGTGA 
TGGTGATTCA TCTCAACAAA 
5 GTGAATTGGT TCACCCAACT 



AATTCTTAGC CTCAATGTAT 
TCAGGTTTTG GTGAACATTA 
TGTTACCGTT GTTTTAACCT 
TCTTACCAGG TGACAATGTC 
GCTTTAGAAG CTGGTCAAAG 



ATTTTGACTA 650 

C AG AC C TC AA 7 00 

TYCCAGAAGG 750 

GAAATGGTTT 800 

ATTCAA 846 



10 



2 ) INFORMATION FOR SEQ ID NO: 507 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 964 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

2 0 (A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507 

25 GGAGCTGTTG TCGTTGTCGC AGCTTCTGAC GGTCAAATGT AATTGAATGC 50 

CCGCCCAGAC GGATGAAAGG ATTTGACGTT TCTAACATCA GTCTAGGCCT 100 

CAGACCAGAG AACATTTGCT CCTTGCCCGC CAGGTCGGTG TC CAGAAGCT 150 

GGTCGTTTTC GTTAACAAGG TCGATGCCGT TGAGGACCCA GAGATGTTGG 200 

AGCTTGTCGA AC TTG AAATG CGTGAACTCC TCAGCCACTA CAGTTTTGAG 25 0 

3 0 GGTGAGGAGA CCCCCATCAT TTTTGGCTCT GCTCTCTGTG CCCTCGAGTC 3 00 

CCGTCGACCT GAGCTTGGTG TCGAGAAGAT TGACGAGCTA TTGAACGCCG 350 

TCGACACCTG GATCCCCACC CCCGAGCGCG CCACTGATAA GCCTTTCCTC 400 

ATGTCCATTG AGGAAGTGTT CTCTATCTCT GGTCGTGGTA CCGTCGTCTC 450 

CGGTCGTGTT GAGCGTGGTA TCCTCAAGAA GGATTCCGAC GTCGAAATTG 5 00 

3 5 TTGGTGGCTC TACCACCCCT ATCAAGACCA AGGTCACAGA TATCGAAACC 550 

TTCAAGAAGT CCTGCGATGA ATCTCGAGCT GGTGACAACT CTGGTCTCCT 600 

TCTCCGAGGT ATCAAGCGTG AGGACTTGAA GCGTGGAATG GTTGTTGCTG 650 

CCCCCGGATC CACCAAGGCT CACACCGACT TCATGGTCTC CCTCTACGTC 700 

CTGAC TGAGG CTGAGGGTGG TCGTTCCAAC GGCTTCACCC ACAAGTACCG 750 

40 CCCTC AAATG TTCATCCGTA CTGCTGGTAT GTAACCAAAG TTTCCGCTAT 800 

TTAC TAAGT A GATCATTGCT AACTTGTATT CCCTTCCGTA GACGAAGCCG 850. 

CATCTTTCAG CTGGCCTGGA GAAGACCAAG ACAAGAAGGC TATGCCTGGT 900 

GACAACGTCG AGATGATTTG CAAAACCCTC CACCCCATTG CTGCCGAGGC 950 

TGGCCAACGA TTCA 964 

45 



2 ) INFORMATION FOR SEQ ID NO : 508 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yarrowia lipolytica 
60 (B) STRAIN: ATCC 38295 

515 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508 

GGTGCTATCA TTGTCGTTGC TGCTGGAGAC GGTTCCAAGC CCCAGACCCG 50 

5 AGAGCATCTG CTGCTTGCTC GACAGGTCGG TGTCCAGAAC CTGGTTGTGT 100 

TTGTTAACAA GGTTGATCAG ATTGATGATA AGGAGATTCT TGAGCTCGTT 15 0 

GACATGGAGA TGCGAGATCT GCTGACCCAG TACGGTTTTG ATGGTGACAA 200 

CACCCCCGTT GTCATGGGCT CTGCTCTGTG CGCTCTTGAG GGCAAGCAGA 25 0 

AGGATATTGG AGAGGACGCC ATCATGGCCC TTATGGATGC CGTTGATGAG 3 00 

10 CACATCCCTA CCCCTAACCG TGACCTTGAG AAGCCCTTCC TGATGCCCGT 3 50 

TGAGGACGTT TTCTCCATCT CTGGCCGAGG AACTGTTGTT ACTGGCCGAG 400 

TCGAGCGAGG AAACCTGAAG AAGGGTGAGG AAATCGAGAT TGTTGGCTAC 450 

AACAACAAGC CCATCAAGGC TGTTGTTACC GGTATTGAGA TGTTCAAGAA 5 00 

GGAGCTCGAG TCCGCCATGG CCGGTGACAA CGCCGGTATC CTGCTCCGAG 550 

15 GTATCAAGCG AGACGAGATC AAGCGAGGTA TGGTCATGTG CAAGCCTGGC 600 

ACCGTCAACG CCCACACCAA GTTCCTTGCT TCTCTTTACA TCATCCCCAC 650 

CGAGGAGGGT GGTCGAACCA GCTCTTTCGG CGCCAACTAC CGACCCCAGA 7 00 

TGTTCATCCG AACTTCTTCC GTCACCGCCA CTCTCACCTT CCCCGAGGGT 75 0 

ACCGACGAGT CCCAGACCGT CAACCCCGGT GACAACACTG AGATGGTTCT 8 00 

2 0 CGAGCTTGTT CACCCTACCG CCATTGAGGT CAACCAGCGA TTCA 844 



2 ) INFORMATION FOR SEQ ID NO: 509 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1067 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Babesia bigemina 

(B) STRAIN: Suarez-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509 

40 CTTGGACAAG CTGAAGAGCG AGCGTGAGCG TGGTATCACC ATTGACATTA 50 

CCCTGTGGAA GTTCGAAACT GGCAAGTACT ACTACACCGT CATTGACGCC 100 

CCCGGTCACC GTGACTTCAT TAAGAACATG ATTAC GGGTA CCTCCCAGGC 150 

CGATGTTGCT ATGCTTGTCG TGCCCGCCGA GGCTGGTGGT TTCGAAGCTG 2 00 

CCTTCTCTAA GGAAGGTCAG ACCCGTGAGC ACGCTCTTTT GGCCTTCACC 2 50 

45 CTTGGTGTCA AGCAGATCAT TTGCGCCATC AACAAGATGG ACAAGTGCGA 3 00 

CTACAAGGAG GAC CGTT AC A GCGAAATCCA GAAGGAAGTT CAGGGTTACC 350 

TGAAGAAGGT CGGTTACAAC ATCGAGAAGG TGCCTTTCGT CGCCATCTCC 400 

GGTTTCATGG GTGACAACAT GGTTGAGCGC TCCACCAACA TGCCGTGGTA 45 0 

CAAGGGCAAG ACCTTGGTCG AGGCCCTCGA CATGATGGAG CCCCCGAAGA 500 

5 0 GGCCCGTCGA CAAGCCCCTG CGTCTTCCCC TCCAGGGTGT GTACAAGATC 550 

GGTGGTATCG GTACCGTCCC TGTCGGTCGT GTGGAGACTG GTCAGCTCAA 600 

GGCCGGTATG GTCCTCACCT TCGCCCCCAA CCCGATCACT ACTGAGTGCA 650 

AATCCGTCGA AATGCACCAC GAAGTTATCG ATGTTGCCAG CCCTGGTGAC 700 

AACGTTGGTT TCAACGTGAA GAACGTGTCC ACCTCTGACA TCCGCACTGG 75 0 

55 TCACGTCGCT TCTG AC TCC A AGAACGACCC CGCCAAGGCC GCCGTGTCCT 800 

TCACCGCCCA GGTCATCATC TTGAACCACC CTGGTACCAT CAAGGCCGGT 850 

TACTCCCCTG TGGTTGACTG CCACACTGCC CACATCTCGT GCAAATTCGA 900 

CGAGATCACC AGCCGTATGG ACAAGCGTAC CGGTAAGGCC CTTGAGGAGA 9 50 

ACCCCAAGAC CATCAAGAAC GGCGACGCCG CTATGGTCGT CCTGAAGCCG 1000 

60 TGCAAGCCCA TGGTCGTCGA GGCCTTCACT GAATACGCTC CCCTTGGTCG 105 0 

516 
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TTTCGCCGTG CGTGACG 



5 2 ) INFORMATION FOR SEQ ID NO: 510 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1049 bases 

(B) TYPE: Nucleic acid 
!0 (C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 



1067 



20 



25 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510 



G AAGTTC GAG 


50 


ACCGTGACTT 


100 


GCTATGCTTG 


150 


CAAGGAAGGA 


200 


TCAAACAGAT 


250 


GAGGACCGTT 


300 


GGTCGGTTAC 


350 


TGGGAGACAA 


400 


AAGACATTGG 


450 


TGACAAGCCA 


500 


TCGGTACCGT 


550 


ATGATTCTAA 


600 


TGAAATGCAC 


650 


GTTTCAACGT 


700 


GCCTCTGATT 


750 


CCAGGTCATT 


800 


CCGTCGTCGA 


850 


ACCAGCCGTA 


900 


GACCATCAAG 


950 


CCATGGTTGT 


1000 


GTTCGTGAC 


1049 



45 2 ) INFORMATION FOR SEQ ID NO: 511 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Crithidia fasciculata 

(B) STRAIN : ATCC 11745 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 511 

6 0 

517 
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TGGACAAGCT GAAGGCGGAG CGCGAGCGCG GTATCACGAT CGATATTGCC 50 

CTGTGGAAGT TCGAGTCGCC CAAGTCCGTG TTCACGATCA TCGATGCCCC 100 

CGGCCACCGC GACTTCATCA AGAACATGAT CACCGGCACC TCCCAGGCCG 150 

ATGC CGC CAT TCTGATGATT GACTCGACCC AGGGTGGCTT CGAGGCTGGC 2 00 

5 ATCTCCAAGG ACGGCCAGAC CCGCGAGCAC GCCCTGCTTG CCTTCACGCT 250 

GGGCGTGAAG CAGATGGTTG TGTGCTGCAA CAAGATGGAC GACAAGACGG 300 

TGCAGTACGC CCAGGCCCGC TACGAGGAGA TCAGCAAGGA GGTCGGCGCG 350 

TACCTGAAGC GCGTGGGCTA CAACCCGGAG AAGGTGCGCT TCATCCCGAT 400 

CTCGGGCTGG CAGGGCGACA ACATGATCGA GAAGTCCGAC AACATGTCGT 450 

10 GGTACAAGGG TCCCACGCTG CTGGAGGCGC TCGACCTGCT GGAGGCCCCC 500 

GTGCGTCCGG TGGACAAGCC GCTGCGCCTG CCCCTGCAGG ACGTGTACAA 550 

GATCGGCGGT ATCGGCACTG TGCCCGTGGG CCGTGTGGAG ACCGGCGTGA 600 

TGAAGCCGGG CGACGTTGTG GTGTTCGCGC CTGCCAACGT GACGACCGAG 650 

GTGAAGTCGA TCGAGATG C A CCACGAGCAG CTGGCTGAGG CCGTGCCCGG 700 

15 CGACAACGTG GGCTTCAACG TGAAGAACGT GTC CGTGAAG GATATCCGCC 750 

GTGGTAACGT GTGCGGCAAC ACGAAGAGCG ACCCCCCGAA GGAGGCGGCC 800 

GACTTCACTG CCCAGGTGAT CGTGCTGAAC CACCCCGGCC AGATCAGCAA 850 

CGGCTACGCG CCGGTGCTGG ACTGCCACAC GAGCCACATC GCGTGCAAGT 900 

TCGCGGACAT CGAGTCCAAG ATCGACCGCC GCTCTGGCAA GGAGCTGGAG 950 

2 0 AAGAGCCCGA AGGCCATCAA GTCCGGCGAT GCGGCCATCG TGAAGATGAT 1000 

CCCGC AGAAG CCGATGTGCG TGGAGGTGTT CAACGACTAC CCGCCGCTGG 1050 

GCCGCTTCGC TGTCCGCGAT " 107Q 



25 



35 



2 ) INFORMATION FOR SEQ ID NO: 512 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1052 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
- (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 

(B) STRAIN: HM1-IMMS 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 

GCTGAAAGAG AAAGAGGAAT TACTATTGAT ATTTCATTAT GGAAATTCGA 50 

AACATCTAAA TACTACTTCA CTATTATTGA TGCCCCAGGT CAYAGAGATT 100 

TCATTAAGAA CATGATTACT GGAACTTCAC AAGC TGATGT TGCCATCCTT 150 

45 ATTGTTGCTG CTGGTACTGG WGAATTTGAA GCTGGTATTT CAAAGAATGG 200 

ACAAACCAGA GAACAYATTC TTCTTTCATA CACTCTTGGA G TTAAAC AAA 250 

TGATTGTTGG WGTTAACAAG ATGGATGCTA TTCAATATAA ACAAGAAAGA 3 00 

TATGAAGAAA TTAAGAAAGA AATTAGTGCA TTCCTTAAGA AGACWGGATA 3 50 

TAATCCAGAC AAGATTCCAT TTGTCCCAAT TTCAGGATTC CAAGGAGATA 400 

50 ATATGATTGA ACCATCAACC AAC ATGC CAT GGTACAAAGG ACCAACATTA 450 

ATTGGAGCAC TTGATTCAGT CACACCACCA GAAAGACCAG TTGATAAACC 5 00 

ACTTAG AC TT CCACTTCAAG ATGTTTAYAA GATTTCAGGT ATTGGAACTG 550 

TACCATGTGG AAGAGTTGAA ACTGGAGTTC TTAGACCAGG AACTATTGTT 600 

CAATTTGCAC CATCAGGAGT TTCATCTGAA TGTAAATCAG TTGAAATGCA 65 0 

55 TCACACAGCA CTTGCTCAAG CTATTCCAGG TGATAATGTT GGATTCAATG 700 

TTAGAAAYTT AACAGTTAAA GATATTAAGA GAGGAAATGT AGCATCAGAT 750 

GCTAAGAATC AACCAGCTGT TGGATGTGAA GATTTCACTG CTCAAGTCAT 800 

TGTCATGAAC CATCCAGGAC AAATTAGAAA GGGATATACA CCAGTTCTTG 850 

ATTGCCATAC ATCACACATT GCATGTAAAT TCGAAGAATT ATTAAGCAAG 900 

60 ATTGATAGAA GAACAGGTAA ATCCATGGAA GGAGGAGAAC CAGAATATAT 950 

518 
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TAAGAATGGA GATTCAGCAC TTGTTAAGAT TTTTrraar^ A^nr„o mm 
GTGTTGAAGA ATTTGCTAAA TTCCCACCAT S 



1052 



2 ) INFORMATION FOR SEQ ID NO: 513 

(i) SEQUENCE CHARACTERISTICS • 
10 (A) LENGTH: 1082 bases' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Giardia lamblia 
2Q (B) STRAIN : Faubert-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513 
o^o^T£ GCG AGCG CGGGAT CACGATCAAC ATCGCGCTCT GGAAGTTPrA 

Hill ilil sgsss ss = 

• ■■■■■ s 

cgaSa-££a? caS£££g§ ??aagSS? SSSSSS S??£ cgt 650 

45 1082 

2) INFORMATION FOR SEQ ID NO: 514 

50 (i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1098 bases" 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
<°) TOPOLOGY: Linear 

5 5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

fi n i = ! ORGANISM: Leishmania tropica 

60 <B) STRAIN: ATCC 30816 

519 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 

ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG CGGCATCACG 50 

5 ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCCG TGTTCACGAT 100 

CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG ATCACGGGCA 15 0 

CGTCGCAGGC GGACGCCGCC ATCCTGATGA TCGACTCGAC GCATGGTGGC 2 00 

TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACCCGCGAGC ACGCGCTGCT 250 

TGCCTTCACT CTTGGCGTGA AGCAGATGGT GGTGTGCTGC AACAAGATGG 3 00 

10 ACGACAAGAC GGTGACGTAC GCGCAGTCGC GCTACGATGA GATCAGCAAG 350 

GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG AGAAGGTGCG 400 

CTTCATCCCG ATCTCGGGCT GGCAGGGCGA CAACATGATC GAGAAGTCGG 45 0 

ACAACATGCC GTGGTACAAG GGTCCCACGC TGCTGGACGC GCTCGACATG 500 

CTGGAGCCGC CGGTGCGCCC GGTGGACAAG CCGCTGCGCC TGCCCCTGCA 550 

15 GGACGTGTAC AAGATCGGCG GTATCGGGAC GGTGCCCGTG GGGCGCGTGG 600 

AGACCGGCAT CATGAAGCCG GGCGACGTGG TGACGTTCGC GCCCGCCAAC 650 

GTGACGACTG AGGTGAAGTC GATCGAGATG CACCACGAGC AGCTGGCGGA 7 00 

GGCGCAGCCC GGCGACAACG TCGGCTTCAA CGTGAAGAAC GTGTCGGTGA 750 

AGGACATCCG CCGTGGTAAC GTGTGCGGCA ACTCGAAGAA CGACCCGCCG 800 

2 0 AAGGAGGCGG CCGACTTCAC GGCGCAGGTG ATCGTGCTGA ACCACCCCGG 850 

CCAGATCAGC AACGGCTACG CGCCGGTGCT GGACTGCCAC ACGAGCCACA 900 

TTGCGTGCCG CTTCGCGGAA ATCGAGTCCA AGATCGACCG CCGCTCCGGC 950 

AAGGAGCTGG AGAAGAACCC CAAGGCGATC AAGTCTGGCG ATGCCGCGAT 1000 

CGTGAAGATG GTGCCGCAGA AGCCGATGTG CGTGGAGGTG TTCAACGACT 1050 

2 5 ACGCGCCGCT GGGCCGCTTT GCCGTGCGCG ACATGCGCCA AACCGTTG 1098 



30 



2 ) INFORMATION FOR SEQ ID NO : 515 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515 

45 TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATC CTG ATG ATCGACTCGA CGCATGGTGG 2 00 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

50 TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 300 

GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGAGG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 50 0 

55 GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACC GGC A TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 65 0 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 7 00 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

60 AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

520 
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GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT 

5 TCGTGAAGAT GGTGCCGCAG AAGCCGATGT 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 
CGTC 



GATCGTGCTG AACCACCCCG 85 0 

TGGACTGCCA CACGAGCCAC 9 00 

AAGATCGACC GCCGCTCCGG 95 0 

CAAGTCTGGC GATGCCGCGA 1000 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAAC CGTTGC 1100 

1104 



10 



2 ) INFORMATION FOR SEQ ID NO : 516 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1106 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanla tropica 

(B) STRAIN: ATCC 30815 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

3 0 ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC KCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

3 5 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 4 50 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGGCGCGTG 600 

GAG AC CGGC A TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

40 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 7 50 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

45 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAAC CGTTGC 1100 

CGTCGG 1106 

50 

2) INFORMATION FOR SEQ ID NO: 517 

55 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 



521 
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(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 
5 (B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

10 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 2 00 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

15 GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 3 50 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

20 AGGACGTGTA CAAGATCGGC GGTATCGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GC AC C AC GAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 75 0 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

25 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 85 0 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCG AC C GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 105 0 

3 0 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC G AC ATGCGC C AAACCGTTG 1099 



2 ) INFORMATION FOR SEQ ID NO : 518 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518 

50 TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 15 0 

ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

55 TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 5 00 

60 GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 55 0 

522 
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AGGACGTGTA CAAGATCGGC GGTATCGGGA 
GAGACCGGCA TCATGAAGCC GGGCGACGTG 
CGTGACGACT GAGGTGAAGT CGATCGAGAT 
AGGCGCAGCC CGGCGACAAC GTCGGCTTCA 
5 AAGGACATCC GCCGTGGCAA CGTGTGCGGC 
GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 
GCCAGATCAG CAACGGCTAC GCGCCGGTGC 
ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 
CAAGGAGCTG GAGAAGAACC CCAAGGCGAT 
10 TCGTGAAGAT GGTGCCGCAG AAGCCGATGT 
TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 



CTGTGCCCGT GGGCCGCGTG 600 

GTGACGTTCG CGCCCGCCAA 650 

GCACCACGAG CAGCTGGCGG 7 00 

ACGTGAAGAA CGTGTCGGTG 750 

AACTCGAAGA ACGACCCGCC 800 

GATCGTGCTG AACCACCCCG 850 

TGGACTGCCA CACGAGCCAC 900 

AAGATCGACC GCCGCTCCGG 95 0 

CAAGTCTGGC GATGCCGCGA 1000 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAACCGTT 109 8 



15 2 ) INFORMATION FOR SEQ ID NO: 519 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 50120 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519 



CTCGAGAAGC— TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

GCTGTGGAAG TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGATGCGC 100 

CCGGCCACCG CGACTTCATC AAGAACATGA TCACCGGCAC GTCGCAGGCC 15 0 

GACGCCGCCA TCCTGATGAT CGACTCGACC CAGGGCGGCT TCGAGGCTGG 2 00 

3 5 CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTC GCCTTCACGC 250 

TTGGTGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 3 00 

GTGCAGTACT CGCAGGCGCG CTACGAGGAG ATCAGCAAGG AGGTGGGCGC 350 

GTACCTGAAG CGCGTCGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 400 

TCTCGGGCTG GCAGGGCGAC AACATGATCG ACAAGTCGGA CAGCATGCCG 450 

40 TGGTACAAGG GGCCCACGCT GCTGGACGCG CTCGACATGC TGGAGGCGCC 5 00 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

ATGAAGCCTG GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGGTGCCCG 7 00 

45 GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 7 50 

CGTGGTAACG TGTGCGGCAA C TCGAAG AAC GACCCGCCGA AGGAGGCGGC 800 

CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCTGGC CAGATCAGCA 850 

ACGGCTACGC GCCGGTGCTG GACTGCCACA CGAGCCACAT CGCGTGCCGC 900 

TTCGCGGATA TCGAGTCCAA GATCGACCGC CGCTCTGGCA AGGAGCTGGA 950 

50 GAAGAACCCC AAGGCGATCA AGTCCGGCGA TGCGGCCATC GTGAAGATGG 1000 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CCCGCCGCTG 1050 
GGGCGCTTCG CTGTGCGCGA C ~ ------- u 



55 

2 ) INFORMATION FOR SEQ ID NO: 52 0 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1071 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 
<B) STRAIN: ATCC 50121 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520 

CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

GCTGTGGAAG TTCGAGTCGC CCAAGTCCGT GTTCACGATC ATCGATGCGC 100 

CCGGCCACCG CGACTTCATC AAGAACATGA TCACGGGCAC GTCGCAGGCG 150 

15 GACGCCGCCA TCCTGATGAT CGACTCGACG CATGGTGGCT TCGAGGCTGG 2 00 

CATCTCGAAG G ACGGC C AG A CCCGCGAGCA CGCGCTGCTT GCCTTCACTC 2 50 

TTGGCGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 3 00 

GTGACGTACG CGCAGTCGCG CTACGATGAG ATCAGCAAGG AGGTGGGCGC 3 50 

GTACCTGAAG CGCGTGGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 4 00 

20 TCTCGGGCTG GCAGGGCGAC AACATGATCG AGAAGTCGGA CAACATGCCG 450 

TGGTACAAGG GTCCCACGCT GCTGGACGCG CTCGACATGC TGGAGCCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 55 0 

AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

ATGAAGCCGG GCGACGTGGT GACGTTCGCG CCCGCCAACG TGACGACTGA 650 

2 5 GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGCAGCCCG 7 00 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 7 50 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 800 

CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCCGGC CAGATCAGCA 850 

ACGGCTATGC GCCGGTGCTG GACTGCCACA CGAGCCACAT TGCGTGCCGC 900 

3 0 TTCGCGGAAA TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTGGA 950 

. GAAGAACCCC AAGGCGATCA AGTCTGGCGA TGCCGCGATC GTGAAGATGG 1000 
TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CGCGCCGCTG 1050 
GGCCGCTTTG CCGTGCGCGA C 1071 

35 

2 ) INFORMATION FOR SEQ ID NO: 521 

( i ) SEQUENCE CHARACTERISTICS : 

4 0 (A) LENGTH: 1114 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania hertigi 

(B) STRAIN : ATCC 50125 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521 



TCGTTCAAGT ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG 50 

CGGTATC AC G ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCGG 100 

55 TGTTCACGAT CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG 150 

ATC AC CGGC A CGTCGCAGGC GGATGCTGCC ATTCTGATGA TCGATTCGAC 200 

GCAGGGTGGC TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACGCGCGAGC 2 50 

ACGCGCTGCT GGCCTTCACG CTGGGCGTGA AGCAGATGGT TGTGTGCTGC 3 00 

AACAAGATGG ACGACAAGAC GGTGCAGTAC GCGCAGGCGC GCTACGAGGA 3 50 

60 GATCAGCAAG GAGGTGGGCG CGTACCTGAA GCGCGTGGGC TACAACCCGG 400 

524 
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AGAAGGTGCG 
GAGAAGTCCG 
GCTGGACATG 
TGCCCCTGCA 
5 GGCCGTGTGG 
GCCCGCCAAC 
AGCTGCAGGA 
GTGTCGGTGA 
CGACCCGCCG 

10 ACCACCCCGG 
ACCAGCCACA 
CCGCTCCGGC 
ACGCCGCCAT 
TTCAACGACT 

15 AACCGTTGCC 



CTTCATCCCG 
ACAACATGTC 
CTGGAGGCGC 
GGACGTGTAC 
AG ACC GGC AT 
GTGACGACGG 
GGCTG TGCCC 
AGGACATCCG 
AAGGAGGCGG 
CCAGATCAGC 
TCGCGTGCCG 
AAGGAGCTGG 
CGTGAAGATG 
ACCCGCCGCT 
GTCG 



ATCTCGGGCT 
GTGGTACAAG 
CGGTGCGCCC 
AAGATCGGCG 
CATGAAGCCC 
AGGTGAAGTC 
GGCGACAACG 
CCGTGGTAAC 
CTGACTTCAC 
AACGGCTACG 
CTTCGCGGAC 
AGAAGAACCC 
GTGCCGCAGA 
GGGCCGCTTT 



GGCAGGGCGA 
GGTCCCACGC 
GGTGGACAAG 
GCATTGGCAC 
GGCGACGTGG 
GATCGAGATG 
TCGGCTTCAA 
GTGTGTGGCA 
GGCGCAGGTG 
CGCCGGTGCT 
ATCGAGTCGA 
CAAGTCCATC 
AGCCGATGTG 
GCGGTGCGCG 



CAACATGATC 45 0 

TGCTGGAGGC 500 

CCGCTGCGCC 55 0 

GGTGCCGGTG 600 

TGACGTTCGC 650 

CACCACGAGC 7 00 

CGTGAAGAAC 750 

ACTCGAAGAA 800 

ATCGTGCTGA 85 0 

GGACTGCCAC 9 00 

AGATCGACCG 9 5 0 

AAGTCCGGCG 1000 

CGTGGAGGTG 105 0 

ACATGCGCCA 110 0 
1114 
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2) INFORMATION FOR SEQ ID NO : 522 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Leishmanial major 
IB.)- - -STRAIN :_-ATCC--5-0122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522 

3 5 TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCCGC CATTCTGATG ATCGACTCGA CGCATGGCGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 2 50 

40 TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGACGTA CGCGCAGTCR CGCTACGATG AGATCAGCAA 3 50 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 

GCTTCATCCC GATCTCGGGC TGG CAGGGCG ACAACATGAT CGAGAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

45 GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 55 0 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGAC CGGC A TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 65 0 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 7 50 

50 AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG C AACGGC TAT GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATTGC GTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC C C AAGGC GAT CAAGTCTGGC GATGCCGCGA 1000 

55 TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 105 0 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCGG 11Q6 



60 
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2) INFORMATION FOR SEQ ID NO: 523 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 

15 (xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 523 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AG TTC GAGTC GCCGAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC C GCGACTTC A TCAAGAACAT GATCACGGGC 150 

2 0 ACGTCGCAGG CGGACGCGGC CATC C TGATG ATCGAC TCGA CGCATGGTGG 2 00 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 3 50 

GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAG AAGGTG C 400 

2 5 GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 450 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 65 0 

3 0 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 7 00 

AGGCGCAGCC CGGCGACAAC GTCGGC-TTCA ACGTGAAGAA CGTGTCGGTG ' 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 9 00 

35 ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGATCGACC GCCGCTCCGG 9 50 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT C AAGTC TGGC GACGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAAC CGTTGC 1100 

CGTCG 1105 

40 



2 ) INFORMATION FOR SEQ ID NO: 524 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
55 (B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 
60 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCGAAGTCC GTGTTCACGA 100 

526 
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TCATCGATGC 
ACGTCGCAGG 
CTTCGAGGCT 
TTGCCTTCAC 
5 GACGACAAGA 
GGAGGTGAGC 
GCTTCATCCC 
GACAACATGC 
GCTGGAGCCG 

10 AGGACGTGTA 
GAGACCGGGA 
CGTGACGACT 
AGGCGCAGCC 
AAGGACATCC 

15 GAAGGAGGCG 
GCCAGATCAG 
ATCGCGTGCC 
CAAGGAGCTG 
TCGTGAAGAT 

20 TACGCGCCGC 



GCCCGGCCAC 
CGGACGCGGC 
GGCATCTCGA 
TCTTGGCGTG 
CGGTGATGTA 
GCGTACCTGA 
GATCTCGGGG 
CGTGGTACAA 
CCGGTGCGCC 
CAAGATCGGC 
TCATGAAGCC 
GAGGTGAAGT 
CGGCGACAAC 
GCCGTGGTAA 
GCCGACTTCA 
CAACGGCTAC 
GCTTCGCGGA 
GAGAAGAACC 
GGTGCCGCAG 
TGGGCCGCTT 



CGCGACTTCA 
CATCCTGATG 
AGGACGGCCA 
AAGCAGATGG 
CGCGCAGTCG 
AGCGCGTGGG 
TGGCAGGGCG 
GGGTCCCACG 
CGGTGGACAA 
GGTATCGGGA 
GGGCGACGTG 
CGATCGAGAT 
GTCGGCTTCA 
CGTGTGCGGC 
CGGCGCAGGT 
GCGCCGGTGC 
GATCGAGTCC 
CCAAGGCGAT 
AAGCCGATGT 
TGCCGTGCGC 



TCAAGAACAT 
ATCGACTCGA 
GACCCGCGAG 
TGGTGTGCTG 
CGCTACGATG 
CTACAACCCG 
ACAACATGAT 
CTGCTGGACG 
GCCGCTGCGC 
CGGTGCCCGT 
GTGACGTTCG 
GCACCACGAG 
ACGTGAAGAA 
AACTCGAAGA 
GATCGTGCTG 
TGGACTGCCA 
AAGATCGACC 
CAAGTCTGGC 
GCGTGGAGGT 
GACATGCGCC 



GATCACGGGC 150 

CGCATGGTGG 2 00 

CACGCGCTGC 250 

CAACAAGATG 3 00 

AGATCAGCAA 350 

GAGAAGGTGC 400 

CGACAAGTCG 450 

CGCTCGACAT 500 

CTGCCCCTGC 55 0 

GGGCCGCGTG 600 

CGCCCGCCAA 650 

CAGCTGGCGG 7 00 

CGTGTCGGTG 750 

ACGACCCGCC 800 

AACCACCCCG 850 

CACGAGCCAC 900 

GCCGCTCCGG 950 

GACGCCGCGA 1000 

GTTCAACGAC 1050 

AAACCGTT 1098 



2 ) INFORMATION FOR SEQ ID NO: 525 



25 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1081 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Leishmanla tarentolae 

(B) STRAIN: II WT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525 

40 GCTGAAGGCG GAGCGCGAGC GCGGCATCAC GATCGACATT GCGCTGTGGA 50 

AGTTCGAGTC GCCCAAGTCG GTGTTCACGA TCATCGATGC GCCCGGCCAC 100 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCGCAGG CGGACGCGGC 150 

CATCCTGATG ATCGACTCGA CGCACGGTGG GTTCGAGGCT GGCATCTCGA 2 00 

AGGACGGGCA GACGCGCGAG CACGCGCTGC TTGCCTTCAC TCTTGGCGTG 2 50 

45 AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGA CGGTGATGTA 3 00 

CGCGCAGTCG CGCTACGATG AGATCAGCAA GGAGGTGGGC GCGTACCTGA 3 50 

AGCGCGTGGG CTACAACCCG GAGAAGGTGC GCTTCATCCC GATCTCGGGC . 400 

TGGCAGGGCG ACAACATGAT CGAGAAGTCG GACAACATGC CGTGGTACAA 450 

GGGTCCCACG CTGCTGGACG CGCTCGACAT GCTGGAGGCG CCGGTGCGCC 500 

50 CGGTGGACAA GCCGCTGCGC CTGCCCCTGC AGGACGTGTA CAAGATCGGC 550 

GGCATCGGCA CGGTGCCCGT GGGCCGCGTG GAGACCGGCA TCATGAAGCC 600 

GGGCGACGTG GTGACGTTCG CGCCCGCGAA CGTGACGACG GAGGTGAAGT 650 

CGATCGAGAT GCACCACGAG CAGCTGGCGG AGGCGCAGCC CGGCGACAAC 7 00 

GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG AAGGACATCC GCCGTGGGAA 7 50 

55 CGTGTGCGGT AACTCGAAGA ACGACCCGCC GAAGGAGGCC GCCGACTTCA 800 

CGGCGCAGGT GATCGTGCTG AACCACCCCG GCCAGATCAG CAACGGCTAC 850 

GCGCCGGTGC TGGACTGCCA CACGAGCCAC ATCGCGTGCC GGTTCGCGGA 900 

CATCGAGTCC AAGATTGACC GCCGCTCCGG CAAGGAGCTG GAGAAGAACC 950 

CCAAGGCGAT CAAGTCCGGC GATGCCGCGA TCGTGAAGAT GGTGCCGCAG 1000 

60 AAGCCGATGT GCGTGGAGAT GTTCAACGAC TACGCGCCGC TTGGCCGCTT 105 0 

527 
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TGCTGTGCGC GACATGCGCC AAACCGTTGC C 1081 



5 2 ) INFORMATION FOR SEQ ID NO: 52 6 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 



20 



45 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 52 6 



AAATACGCGT GGGTGCTCGA CAAGCTGAAG GCGGAGCGCG AGCGCGGCAT 50 

CACGATCGAC ATTGCGCTGT GGAAGTTCGA GTCGCCCAAG TCCGTGTTCA 100 

CGATCATCGA TGCGCCCGGC CACCGCGACT TCATCAAGAA CATGATCACG 150 

GGCACGTCGC AGGCGGACGC CGCCATCCTG ATGATCGACT CGACGCATGG 200 

25 TGGCTTCGAG GCTGGCATCT CGAAGGACGG CCAGACCCGC GAGCACGCGC 250 

TGCTTGCCTT CACKCTTGGC GTGAAGCAGA TGGTGGTGTG CTGCAACAAG 3 00 

ATGGACGACA AGACGGTGAC GTACGCGCAG TCGCGCTACG ATGAGATCAG 3 50 

CAAGGAGGTG GGCGCGTACC TGAAGCGCGT GGGCTACAAC CCGGAGAAGG 400 

TGCGCTTCAT CCCGATCTCG GGCTGGCAGG GCGACAACAT GATCGAGAAG 45 0 

30 TCGGACAACA TGCCGTGGTA CAAGGGTCCC ACGCTGCTGG ACGCGCTCGA 500 

CATGCTGGAG CCGCCGGTGC GCCCGGTGGA CAAGCCGCTG CGCCTGCCCC 550 

TGCAGGACGT GTACAAGATC GGCGGTATCG GGACGGTGCC CGTGGGGCGC 600 

GTGGAGACCG GCATCATGAA GCCGGGCGAC GTGGTGACGT TCGCGCCCGC 65 0 

CAACGTGACG ACTGAGGTGA AGTCGATCGA G ATGC AC C AC GAGCAGCTGG 7 00 

3 5 CGGAGGCGCA GCCCGGCGAC AACGTCGGCT TCAACGTGAA GAACGTGTCG 750 

GTGAAGGACA TCCGCCGTGG TAACGTGTGC GGCAACTCGA AGAACGACCC 800 

GCCGAAGGAG GCGGCCGACT TCACGGCGCA GGTGATCGTG CTGAACCACC 850 

CCGGCCAGAT CAGCAACGGC TACGCGCCGG TGCTGGACTG CCACACGAGC 900 

CACATTGCGT GCCGCTTCGC GGAAATCGAG TCCAAGATCG ACCGCCGCTC 950 

40 CGGCAAGGAG CTGGAGAAGA ACCCCAAGGC GATCAAGTCT GGCGATGCCG 1000 

CGATCGTGAA GATGGTGCCG CAGAAGCCGA TGTGCGTGGA GGTGTTCAAC 1050 

GACTACGCGC CGCTGGGCCG CTTTGCCGTG CGCGACATGC GCCAAACCGT 1100 

TG 1102 



2 ) INFORMATION FOR SEQ ID NO : 527 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neospora caninum 

(B) STRAIN: Suarez-4 



528 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527 

GGACRAACTT AAAGCTGAAC GTGAGCGTGG TATCACCATT GATATCTCCC 50 

TGTGGAAATT TGAGACCAGC AAGTACTATG TTACCATCAT TGATGCCCCA 100 

5 GGACACAGAG ACTTCATCAA AAACATGATT ACAGGCACAT CCCAGGCTGA 150 

CTGTGCTGTC CTGATTGTTG CTGCTGGTGT TGGTGAATTT GAAGCCGGTA 200 

TCTCCAAGAA CGGGCAGACC CGTGAGCATG CCCTTNTGGC TTACACCCTG 250 

GGTGTGAAAC AACTAATTGT TGGCGTTAAC AAAAKGGATT CCACTGAGCC 300 

ACCCTATAGC CARAAGAGAT ACGARGAAAT TGTTAAGGAA GTCAGCMCCT 350 

10 AYNTTAAAAA AATTGGYTAC AACCCCGACA CAGTANCATT TGKGCCAATT 400 

TNTGGCTGGA ATGGTGACAA CATGCTGGAN CCAAGTGCTA ATATGCCATG 450 

GTTCAAGGGA TGGAAAGTCM CCCGTAAGGA CGGCAATGCC AGKGGAACCM 500 

CCCTGCTTGA AGCTYTGGAT TGCATTYTGC CACCAAYTTG CCCAACTGAC 550 

AAACCCTTGC GTTTGCCTYT CCAGGATGTC TATAAAATTG GKGGTATTGG 600 

15 TACTGTCCCT GTGGGTCGTG TGGAGACTGG TGTTCTCAAA CCTGGCATGG 650 

TGGTCACCTT TGCTCCAGTC AATGTAACAA CTGAAGTGAA GTCTGTAGAA 700 

ATGCACCATG AAGCATTGAG TGAAGCCCTT CCTGGGGACA ATGTGGGCTT 750 

CAATGTCAAG AACGTGTCTG TCAAAGATGT CCGTCGTGGC AATGTGGCTG 800 

GTGACAGCAA AAATGATCCA CCCATGGAAG CTGCTGGCTT CACAGCTCAG 850 

2 0 GTGATTATTT TGAACCATCC AGGC C AAATC AGTGCTGGAT ATGCACCTGT 900 

GCTGGATTGT CACACAGCTC AC ATTGC TTG CAAGTTTGCT GAGCTGAAGG 950 

AGAAGATTGA TCGTCGTTCT GGGAAAAAGC TGGAAGATGG CCCTAAATTC 1000 

TTGAAATCTG GTGACGCTGC CATCGTTGAT ATGGTTCCTG GCAAGCCCAT 1050 

GTGTGTCGAG AGCTTCTCTG ATTATCCTCC CCTGGGCCGT TTTGC TGTGC 1100 

25 GTGAC 1105 



30 



2 ) INFORMATION FOR SEQ ID NO: 52 8 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Trichomonas vaginalis 

(B) STRAIN: ATCC 3 0001 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528 

45 GCACATCCCA GGCTGATGCT GCTATCCTTG TCATCGACTC CACACTCGGT 50 

GGYTTCGAAG CCGGTATCGC TGAACAAGGC CAGACACGTG AACACGCTCT 100 

TCTTGCCTTC ACACTCGGCA TCAAGCAGGT CATTGTCGCC GTCAACAAGA 150 

TGGATGACAA GACAGTCAAC TACAACAAGG CYCGTTTCGA CG AAATC AC A 200 

GCCGAAATGA CACGCATCCT TACAGGCATC GGCTACAAGC CAGAAATGTT 250 

50 CCGCTTCGTC CCAATCTCCG GCTGGGCTGG CGACAACATG ACAGAGAAGT 3 00 

CTCCAAACAT GCCATGGTAC AATGGC C C AT ACCTTCTTGA AGCCCTCGAT 3 50 

TCCCTTCAGC CACCAAAGCG CCCATTCGAC AAGCCACTCC GTCTTCCACT 400 

CCAGGATGTC TACAAGATCA ACGGTATCGG TACAGTTCCA GTCGGCCGTG 450 

TCGAATCCGG CACAATGAAG CCAGGCATGA TCGTTAACTT CGCCCCATCC 500 

55 ACAGTTACAG CTGAAGTTAA GTCCATCGAA ATGCACCACG AATCCCTTCC 550 

AGAGGCTCTT CCAGGTGACA ACATCGGCTT CAACGTCAAG AACGTTTCCA 600 

CAGCTGATGT CAAGCGTGGC TACGTCGTTG GTGATACAAA GCGTGACCCA 65 0 

CCAGTCGAAT GCGCTTCCTT CACAGCTCAR ATGATCATCT CCAACCACCC 700 

AGGCAAGATC CACGCCGGCT ACCAGCCAGT TTTCG AC TGC CACACAGCTC 750 

60 ACATCGCCTG CAAGTTCGAC AAGCTCATCC AGCGTATCGA TCGTCGCCAC 800 

529 
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GGCAAGAAGG CTACAGAGAA CCCAGAATAC ATTCAGAAGG ATGATGCCGC 850 
TATCGTCGAG GTTGTCCCAT CCAAGCCACT CGTCGTCGAG TCCTTCCAGG 900 
AGTACCCACC ACTCGGCCGT TTCGCCATCC GTGAT 93 5 



2 ) INFORMATION FOR SEQ ID NO : 529 

( i ) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 1065 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei subsp. brucei 

(B) STRAIN: EATR0795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529 



20 



AAGCTGAAGG CTGAGCGCGA ACGTGGTATC ACGATCGACA TTGCACTGTG 50 

GAAATTCGAG TCACCCAAGT CTGTCTTCAC TATTATTGAT GCTCCTGGGC 100 

25 ACCGTGACTT CATCAAGAAC ATGATCACCG GCACATCGCA AGCCGACGCA 150 

GCCATCCTCA TCATTGCCTC TGCGCAGGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGA CAGACCCGCG AGCACGCGTT GCTGGCCTTC ACTTTGGGTG 250 

TGAAGCAGAT GGTTGTGTGC TGCAACAAGA TGGACGACAA GACTGTGAAC 300 

TACGGACAGG AGCGGTATGA CGAGATTGTG AAGGAGGTGT CTGCTTACAT 350 

3 0 CAAGAAGGTT GGGTACAACG TGGAGAAGGT GCGCTTCGTC CCCATCTCCG 400 

GATGGCAGGG CGACAACATG ATTGAGAAAT CCGAGAAGAT GCCATGGTAC 450 

AAGGGTCCAA CGCTCCTGGA GGC AC TAG AC ATGCTGGAGC CACCAGTGCG 500 

TCCGAGCGAC AAGCCCCTGC GTCTGCCACT GCAGGACGTG TACAAGATCG 55 0 

GTGGTATTGG CACCGTGCCC GTTGGTC GTG TGGAGACCGG CGTGATGAAG 600 

35 CCTGGTGATG TGGTGACGTT TGCCCCCGCC AACGTGACGA CCGAGGTGAA 65 0 

ATCGATCGAG ATGCACCACG AGCAGCTCGC TGAGGCGACC CCCGGTGACA 700 

ACGTCGGCTT TAACGTGAAG AACGTTTCTG TAAAGGACAT CCGCCGTGGC 750 

AACGTCTGCG GTAACACCAA GAACGACCCC CCAAAGGAGG CCGCCGACTT 800 

CACGGCACAG GTGATCATCC TGAACCACCC CGGACAGATT GGAAACGGTT 850 

40 ATGCGCCCGT GCTGGACTGC CACACATCGC ACATTGCCTG CAAGTTCGCG 900 

GAGATCGAGT CGAAGATCGA CCGTCGCTCT GGCAAGGAGC TGGAGAAGGC 950 

TCCCAAGTCG ATCAAGTCTG GCGACGCCGC GATCGTGCGC ATGGTGCCGC 1000 

AG AAGC CTAT GTGCGTGGAG GTCTTCAACG ACTACGCGCC ACTCGGCCGC 1050 

TTTGCCGTGC GTGAC 1065 

45 



2 ) INFORMATION FOR SEQ ID NO : 530 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Crithidia fasciculata 
60 (B) STRAIN: ATCC 11745 

530 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530 

TTCGCGGAGG GCGTGCCGCC GGTGCTGACG GCGCTGGACG TGACGGAGGA 50 

5 CCTCGGCCGC GACGAGCCGC TGACGCTGGA GATTGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCCATGCAGA CGACGGATCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC GGGCGGCAAC ATCTCCGTGC CGGTTGGCCG 2 00 

CGAGACGCTG GGCCGCATCT TCAACGTGCT CGGCGACGCG ATCGACCAGC 2 50 

GCGGTGTGGT GGGCGAGAAG ATGCGCATGC CGATCCACGC CGAGGCGCCG 300 

10 AAGC TGGCGG ACCAGGCCGC GGAGGACGCG ATTCTGACGA CCGGCATCAA 3 50 

GGTGATCGAC CTGATTCTGC CGTACTGCAA GGGTGGCAAG ATCGGGCTGT 4 00 

TCGGCGGTGC TGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 4 50 

AACGTGGCCA AGGGCCACGG TGGTTTCTCC GTGTTCGCCG GCGTTGGCGA 5 00 

GCGCACCCGC GAGGGCACGG ATCTGTACCT GGAGATGATG CAGTCGAAGG 550 

15 TCATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTGGTGTA CGGCCAGATG 600 

AACGAGCCCC CGGGTGCGCG TGCGCGTGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGTGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 7 00 

ACAACATCTT CCGCTTCACC CAGGCCAACT CCGAGGTGTC CGCCCTGCTG 750 

GGCCGCATTC CCGCCGCCGT GGGCTACCAG CCGACGCTTG CCGAGGATCT 8 00 

20 TGGTATGCTG CAGGAGCGCA TTACGTCGAC GACGAAGGGC TCGATTACGT 850 

CTGTGCAGGC CGTGTACGTG CCGGCCGATG ATATCACGGA TCCGGCGCCG 900 

GCGACGACCT TCTCGCACCT GGATGCGACG ACGGTGCTGG ACCGCGCGGT 95 0 

TGCCGAGTCT GGCATCTACC CCGCCGTGAA CCCGCTGGAG TGCGCGTCGC 10 00 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

25 GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

CGTGCTGGGT ATCGACGAGC TGAGCGAGGA GGACAAGCTT GTGGTGGACC 115 0 

GCGCTCGCAA GGTGACGCGC TTCCTGTCGC AGCCGTTCCA GGTGGCCGAG 12 00 

GTGTTCACCG GCATGACGGG CCACTACGTG CAGCTGGAGG ACACAGTGGA 1250 

GTCGTTCTCT GGCCTGCTGA TGGGCTCGTA CGACCAGATC CCGGAGA 12 97 

30 



2 ) INFORMATION FOR SEQ ID NO: 531 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
<D) TOPOLOGY: Linear 

40 

{ ii ) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
45 (B) STRAIN: ATCC 30816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531 



CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

50 ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACGG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 200 

GTGAGACGCT GGGCCGCATC TTCAAYGTTC TGGGCGACGC G ATCG AC C AG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

55 GAAGCTGGCG GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAACGTCGCG AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

60 GTGATTGACC TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT 600 

531 
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GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 65 0 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 7 00 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 7 50 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

5 TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 85 0 

TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 9 00 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

10 GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 12 00 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 125 0 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 129 8 

15 

2) INFORMATION FOR SEQ ID NO: 532 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanla aethioplca 
30 (B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532 

TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

35 CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 200 

TGAGACGCTG GGCCGCATCT TCAACGTTCT GGGCGACGCG ATCGACCAGC 250 

GCGGCCCCGT GGGCGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCA 3 00 

40 AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 400 

TCGGCGGTGC CGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 45 0 

AACGTCGCGA AGGGCCACGG TGGTTTCTCC GTGTTTGCCG GCGTTGGCGA 500 

GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

45 TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGCGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

GGCCGCATTC CAGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 

50 TGGTATGCTG CAGGAGCGCA TCACGTCGAC AACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCAGCGGATG ATATCACGGA TCCCGCGCCC 900 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

GGCGGAGTCG GGCATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

GTATCATGGA CCCCGACGTG ATCGATGTGG ACCACTACAA CGTTGCGCAG 1050 

55 GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGC ATCGACGAGC TGAGCGAGGA AGACAAGGTT GTTGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 1200 

GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGTCG ACACGGTGGA 1250 

GTCGTTCTCT GGCCTGCTGA TGGGGTCGTA CGACCAGATC CCGGAGA 1297 

60 
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2 ) INFORMATION FOR SEQ ID NO: 533 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 
15 (B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533 

CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

2 0 ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 15 0 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 2 00 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

25 GAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

30 GTGATTGACC TGAAGGGCGA GTC G AAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 7 00 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

35 TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG G ACC AC T AC A ACGTTGCGCA 105 0 

40 GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 1298 

45 



2 ) INFORMATION FOR SEQ ID NO: 534 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 
60 (B) STRAIN: MOU 

533 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 534 

CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

5 ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 2 00 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

10 AAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 3 50 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 45 0 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACC TATACC TGGAGATGAT GCAGTCGAAG 550 

15 GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 7 00 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

2 0 TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACTG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGC GAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

25 GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACG AC C AG AT CCCGGAGA 129 8 

30 



2 ) INFORMATION FOR SEQ ID NO: 535 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 01 bases 

(B) TY£>E: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA- 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerhilli 
45 (B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535 

GCACTTCTCG GAGGGCGTGC CGCCCGTGCT GACGGCGCTG GATGTGACGG 50 

5 0 AGGAC CTTGG CCGCGATGAG CCGCTGACGC TGGAGATCGT GCAGCACTTG 100 

GACGCGAACA CCGGCCGCTG CATTGCGATG CAGACGACGG ACCTGCTGAA 150 

GCTGAAGTCG AAGGTTGTGT CGACCGGTGG CAACATCTCT GTGCCGGTGG 200 

GCCGTGAGAC GCTGGGCCGC ATCTTCAACG TTCTGGGCGA TGCGATCGAC 250 

CAGCGCGGCC CCGTGGGCGA GAAGATGCGC ATGGCGATCC ACGCCGAGGC 300 

55 CCCGAAGCTG GCGGATCAGG CCGCGGAGGA CACGATCCTG ACGACCGGCA 3 50 

TCAAGGTGAT CGACCTGATT CTGCCCTACT GCAAGGGTGG CAAGATCGGY 400 

CTGTTCGGCG GTGCCGGTGT GGGCAAGACT GTGATCATCA TGGAGCTGAT 450 

CAACAACGTC GCGAAGGGCC ACGGTGGTTT CTCCGTGTTT GCCGGCGTTG 500 

GCGAGCGCAC GCGCGAGGGC ACGGACCTGT AC CTGGAGAT GATGCAGTCG 550 

60 AAGGTGATTG ACCTGAAGGG CGAGTCGAAG TGCGTGCTTG TGTACGGGCA 600 
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GATGAACGAG CCCCCGGGTG CGCGCGCGCG CGTTGCGCAG TCTGCGCTGA 650 

CGATGGCGGA GTACTTCCGC GACGTGGAGG GCCAGAACGT GCTGCTGTTC 7 00 

ATCGACAACA TCTTCCGCTT CACGCAGGCG AACTCCGAGG TGTCCGCGCT 7 50 

GCTGGGCCGC ATTCCGGCCG CCGTGGGCTA CCAGCCGACG CTTGCGGAGG 800 

5 ATCTTGGTAT GCTGCAGGAG CGCATCACGT CGACAACGAA GGGGTCGATC 850 

ACGTCCGTGC AGGCCGTGTA CGTGCCAGCG GATGATATCA CGGATCCCGC 900 

GCCCGCGACG ACGTTCTCGC ACCTTGACGC GACGACTGTG CTGGACCGCG 9 50 

CGGTGGCGGA GTCGGGCATC TACCCTGCCG TGAACCCGCT GGAGTGCGCG 1000 

TCGCGTATCA TGGACCCCGA TGTGATCGAT GTGGACCACT ACAACGTTGC 1050 

10 GCAGGATATC GTGCAGATGC TGACCAAGTA CAAGGAGCTG CAGGACATCA 1100 

TTGCGGTGCT TGGCATCGAC GAGCTGAGCG AGGAAGACAA GGTTGTGGTG 1150 

GACCGCGCGC GCAAGGTGAC CCGGTTCCTG TCGCAGCCGT TCCAGGTTGC 12 00 

GGAGGTGTTC ACGGGCATGA CGGGCCACTA CGTGCAGCTG GTCGACACGG 12 5 0 

TGGAGTCGTT CTCTGGCTTG CTGATGGGGT CGTACGACCA GATCCCGGAG 13 00 

15 A 1301 



20 



2 ) INFORMATION FOR SEQ ID NO: 53 6 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Lelshmanla hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 6 

35 CTTCGCGGAG GGCGTGCCGC CGGTGCTGAC GTCGCTGGAT GTGACGGAGA 50 

ACCTCGGCCG CGATGAGCCG CTGACGCTGG AGATTGTGCA GC AC TTGG AC 100 

GCGAACACCG GTCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGTGGCAA CATCTCTGTG CCTGTTGGCC 2 00 

GCGAGACGCT GGGTCGCATC TTCAACGTGC TTGGCGATGC G ATTGAC C AG 2 50 

40 CGCGGCCCTG TGGGTGAGAA GATGCGCATG GCGATCCACG CCGAGGCGCC 3 00 

GAAGCTGGCG GATCAGGCGG CAGAGGACAC GATCCTGACG ACCGGCATCA 3 50 

AGGTGATCGA TCTTATTCTG CCGTACTGCA AGGGTGGTAA GATCGGTCTG 4 00 

TTCGGTGGTG CCGGTGTAGG CAAGACTGTG ATTATTATGG AGCTGATCAA 450 

TAACGTGGCG AAGGGCCACG GTGGGTTTTC CGTGTTTGCT GGCGTGGGCG 5 00 

45 AGCGCACGCG CGAGGGCACT GACCTGTACC TGGAGATGAT GCAGTCGAAG 55 0 

GTGATTGACC TGAAGGGCGA ATCAAAGTGC GTGCTTGTGT ACGGACAGAT 600 

GAACGAGCCC CCGGGTGCGC GTGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCCGAGTA CTTCCGCGAT GTGGAGGGCC AGAACGTGCT GCTGTTCATT 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 75 0 

50 GGGTCGCATT CCTGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TGGGCATGCT GCAGGAGCGC ATTACGTCGA CGACGAAGGG CTCGATTACG 850 

TCTGTGCAGG CCGTGTACGT GCCTGCGGAT GATATCACGG ACCCGGCGCC 9 00 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGTGCGG 95 0 

TGGCAGAGTC GGGCATTTAC CCTGCGGTGA ACCCGCTGGA GTGCGCGTCG 1000 

55 CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 105 0 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 110 0 

CCGTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 115 0 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 12 00 

GGTGTTCACT GGCATGACGG GTCACTACGT TCAGCTGGAG GACACGGTGG 12 5 0 

60 AGTCGTTCTC TGGCCTACTG ATGGGGTCAT ACGACCAGAT CCCGGAGA 129 8 

535 
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2 ) INFORMATION FOR SEQ ID NO: 537 

5 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 

(C) STRAND EDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537 

20 CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG TGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATC TCTGTG CCGGTGGGCC 2 00 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGATGC GATCGACCAG 250 

25 CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GATCAGGCCG CAGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CTTGATCCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAATGTCGCG AAGGGCCACG GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

30 AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CCGCGCTGCT 7 50 

35 GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATC AC GTCG A CAACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGATGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGC CGTGA ACCCGCTGGA GTGCGCGTCG 1000 

40 CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 105 0 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GACATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGC GG A 1200 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 1250 

45 AGTCGTTCTC TGGC CTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAG 1297 



50 



2 ) INFORMATION FOR SEQ ID NO: 53 8 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1297 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Leishmania amazonensis 

536 



CA 02307010 2000-05-19 



(B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 53 8 

5 TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG C AC CTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GTTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 200 

TGAGACGCTG GGCCGCATCT TCAACGTGCT GGGCGACGCG ATCGACCAGC 2 50 

10 GCGGCCCCGT GGGTGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCG 300 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTC TGC CCTACTGCAA GGGTGGCAAG ATCGGC CTGT 400 

TTGGTGGCGC CGGTGTGGGC AAGACCGTGA TCATCATGGA GTTGATTAAC 450 

AACGTCGCGA AGGGCCACGG TGGTTTCTCG GTGTTTGCCG GCGTTGGCGA 500 

15 GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGC AG ATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGAGACG TGGAGGGCCA GAATGTGCTG CTGTTCATCG 7 00 

ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGC GCTGCTG 7 50 

20 GGCCGCATTC CGGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC GACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCTGCGGATG ATATCACGGA TCCGGCGCCC 900 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

GGCGGAGTCG GGGATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 1000 

25 GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AG ATGC TG AC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGT ATCGACGAGC TGAGCGAGGA GGACAAGGTC GTGGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 12 00 

GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGCCG ACACGGTGGA 1250 

3 0 GTCGTTCTCT GGGCTGCTGA TGGGGTCGTA CGACCAGATC CCGGAGA 1297 



35 



2 ) INFORMATION FOR SEQ ID NO : 539 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539 

45 

CCITACATCC TBGTYGC I CT IAACAAG 27 



50 2 ) INFORMATION FOR SEQ ID NO: 540 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: DNA 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540 

537 
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15 



20 



55 



GGDGCITCYT CRTCGWAITC CTG 23 



2) INFORMATION FOR SEQ ID NO: 541 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 541 
GTKGAAATGT TCCGCAAGCT GCT 23 

2) INFORMATION FOR SEQ ID NO: 542 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542 
CGGAARTAGA ACTG SGGACG GTAG 24 

35 

2 ) INFORMATION FOR SEQ ID NO: 543 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543 
50 ATCTTAGTAG TTTCTGCTGC TGA 23 



2 ) INFORMATION FOR SEQ ID NO : 544 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
60 (D) TOPOLOGY: Linear 

538 
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5 



25 



35 



40 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 544 
AYGTTGTCGC CMGGCATTMC CAT 23 



10 2 ) INFORMATION FOR SEQ ID NO: 545 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545 

TACATCCTBG TYGCICTIAA CAAGTG 2 6 



2 ) INFORMATION FOR SEQ ID NO : 546 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546 
CCRCGICCGG TRATGGTGAA GAT 23 

2 ) INFORMATION FOR SEQ ID NO: 547 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547 

GTACAGTTGC TTCAGGACGT ATC 23 

2 ) INFORMATION FOR SEQ ID NO : 548 

60 (i) SEQUENCE CHARACTERISTICS: 

539 
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(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548 
ACGTTCGATT TCATCACGTT G 



2) INFORMATION FOR SEQ ID NO: 549 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : S i ngl e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549 
GAACGTGATA CTGACAAACC TTTA 



2) INFORMATION FOR SEQ ID NO: 550 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550 
GAAGAAGAAC ACCAACGTTG 



2 ) INFORMATION FOR SEQ ID NO : 551 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551 
GAAGAAAAAA TCTTCGAACT GGCTA 



540 
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2 ) INFORMATION FOR SEQ ID NO: 552 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552 
TACACGGCCG GTGACTACG 1 9 

15 

2 ) INFORMATION FOR SEQ ID NO: 553 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553 
30 GGCCGTGTTG AACGTGGTCA AATCA 25 



35 



50 



60 



2 ) INFORMATION FOR SEQ ID NO : 554 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 554 

45 

GTTCCTTACA TCGTTGTTTT TCTC 24 



2 ) INFORMATION FOR SEQ ID NO : 555 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

541 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555 
TCTCGAACTT TCTCTATGTA TGCA 24 

5 

2) INFORMATION FOR SEQ ID NO: 556 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Single 

(D ) TOPOLOGY : Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556 
CGGCGCNATC YTSGTTGTTG C 21 



20 



40 



2 ) INFORMATION FOR SEQ ID NO: 557 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE ~TYP E : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557 
3 5 CCMAGGCATR ACCATCTCGG TG 22 



2 ) INFORMATION FOR SEQ ID NO : 558 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 58 
TCITTYAART AYGC I TGGGT ^ u 



55 2 ) INFORMATION FOR SEQ ID NO: 559 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 

542 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559 

CCGACRGCRA YIGTYTGICK CAT 2 3 



10 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 560 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560 
GAYTTCATYA ARAAYATGAT YAC 23 

2) INFORMATION FOR SEQ ID NO: 561 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 561 
ACIGTICGGC CRCCCTCACG GAT 2 3 

40 

2 ) INFORMATION FOR SEQ ID NO: 562 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562 
55 CARATGRAYG ARCCICCIGG IGYIMGIATG 3 0 



2 ) INFORMATION FOR SEQ ID NO: 563 

60 
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10 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 63 
GGYTGRTAIC CIACIGCIGA IGGCAT 26 



15 2) INFORMATION FOR SEQ ID NO: 564 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564 

TAYGGICARA TG AAYG ARC C ICCIGGIAA 29 

30 

2 ) INFORMATION FOR SEQ ID NO: 565 

(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 26 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 65 
GGYTGRTAIC CIACIGCIGA IGGDAT 26 

2 ) INFORMATION FOR SEQ ID NO : 566 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 566 
TTYGGIGGIG CIGGIGTIGG IAARAC 26 

60 
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10 



20 



2 ) INFORMATION FOR SEQ ID NO: 567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567 
15 TCRTCIGCIG GIACRTAIAY IGCYTG 2 6 



2 ) INFORMATION FOR SEQ ID NO: 568 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 568 

30 

RTIATIGGIG CIGTIRTIGA YGT • 23 



3 5 2 ) INFORMATION FOR SEQ ID NO: 569 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569 

RTIRTIGGIS CIGTIRTIGA TAT 23 



50 

2 ) INFORMATION FOR SEQ' ID NO: 57 0 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: DNA 

545 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 0 
RTIRYIGGIC CIGTIRTIGA YGT 23 

5 

2 ) INFORMATION FOR SEQ ID NO: 571 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571 
RTIRTIGGIC CIGTIRTIGA TGT 23 

20 

2 ) INFORMATION FOR SEQ ID NO: 572 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

( c ) STRANDEDNESS : S ing 1 e 
(D) TOPOLOGY: Linear 



30 



(ii)~T*OLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 572 



3 5 RTIRTIGGIS CIGTIRTIGA 20 



2 ) INFORMATION FOR SEQ ID NO: 573 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 573 

50 

CCICCIACCA TRTARAAIGC 



55 2) INFORMATION FOR SEQ ID NO: 574 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
60 ( C ) STRANDEDNES S : S ingl e 

546 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574 

ATIGCIATGG AYGGIACIGA RGG 23 



10 



20 



25 



60 



2 ) INFORMATION FOR SEQ ID NO: 57 5 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575 
TIACCATTTC AGTACCTTCT GGTAA 2 5 

2) INFORMATION FOR SEQ ID NO: 57 6 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 2 6 bases 

' (B) TYPE:* Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576 
AACTTCRTCA AGAAGGTYGG TTACAA 26 

40 

2 ) INFORMATION FOR SEQ ID NO: 577 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577 
5 5 C ATGATTGAA CC ATCCACC A 2 0 



2 ) INFORMATION FOR SEQ ID NO: 57 8 

547 
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10 



30 



40 



45 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578 
CATGATTGAA GCTTCCACCA 2 0 



15 2 } INFORMATION FOR SEQ ID NO: 579 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579 

GAAGGCCGTG CTGGTGAGAA 2 0 



2 ) INFORMATION FOR SEQ ID NO: 580 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 5 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580 
GCTAAACCAG CTACAATCAC TCCAC 25 

2 ) INFORMATION FOR SEQ ID NO : 581 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581 
ACATCGGTGC ATT ATT AC GT GG 22 



548 
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10 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 582 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582 
15 TTTCAACTTC GTCGTTGACA CGAACAGT 28 



2 ) INFORMATION FOR SEQ ID NO: 583 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
2 5 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583 
CAACTGCTTT TTGGATATCT TCTTTAATAC CAACG 3 5 



35 2 ) INFORMATION FOR SEQ ID NO: 584 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584 

ACATGACACA TCTAAAACAA 2 0 



50 



60 



2 ) INFORMATION FOR SEQ ID NO: 585 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

549 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585 
AC C AC ATACT GAATTCAAAG 2 0 

5 

2) INFORMATION FOR SEQ ID NO : 586 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : L inear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586 
C AGAAGTATA CGTATTATCA 2 0 

20 

2 ) INFORMATION FOR SEQ ID NO: 587 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587 
35 CGTATTATCA AAAGACGAAG 20 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 588 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588 
TCTTCTCAAA CTATCGTCCA 2 0 



55 2 ) INFORMATION FOR SEQ ID NO : 589 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 

550 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 589 

GCACGAAACT TCTAAAACAA 2 0 



10 



20 



25 



60 



2) INFORMATION FOR SEQ ID NO: 590 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590 
TATACGTATT ATCTAAAGAT 2 0 

2 ) INFORMATION FOR SEQ ID NO : 591 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591 
TCCTGGTTCT ATTACACCAC 2 0 

40 

2 ) INFORMATION FOR SEQ ID NO: 592 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592 
55 CAAAGCTGAA GTATACGTAT 20 



2 ) INFORMATION FOR SEQ ID NO: 593 

551 



CA 
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(D) 



cvo ID NO: 594 

~« cpO ID NO: 595 

« (B 1 IS»>DEI«reSS-. Single 

aOLEC.LE TYEE. I» ^ 
SEQ^CB ^CKXPTXOK- SEQ 
CGTATACGTT 

dBO ID NO: 596 
^INFORMATION FOR SBQ 

tA ' mvo-p. Nucleic a{ r lcl l0 
1?' SSiDEPNESS: Single 
$ toXoCT: Linear 

(it) MOLECULE TVPE: ^ 
SEQUENCE BESCK1ETXO- 



40 



45 



50 
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2 ) INFORMATION FOR SEQ ID NO : 597 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

10 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597 
1 5 ATTGGTATCA AAGAAAC TTC 



20 



2 ) INFORMATION FOR SEQ ID NO : 598 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598 
AATTACACCT CACACAAAAT * u 



35 2 ) INFORMATION FOR SEQ ID NO: 599 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 599 

CGGTGAAGAA ATCGAAATCA 



50 



60 



2 ) INFORMATION FOR SEQ ID NO : 600 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

553 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 60 
ATGCAAGAAG AATCAAGCAA 



2 ) INFORMATION FOR SEQ ID NO: 601 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
GTTTCACGTG ATGATGTACA 



2 ) INFORMATION FOR SEQ ID NO: 602 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
AAGTTGAAGT TGTTGGTATT 



2 ) INFORMATION FOR SEQ ID NO: 603 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
GGTATTAAAG ACGAAACATC 



2 ) INFORMATION FOR SEQ ID NO : 604 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

554 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604 

GGTGATGAAG TAGAAATCGT 20 



10 



20 



25 



50 



2 ) INFORMATION FOR SEQ ID NO: 605 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605 
GAAATGTTCC GTAAATTATT 2 0 

2 ) INFORMATION FOR SEQ ID NO: 606 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 20 bases 

(B) "TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606 
ATTAGACTAC GCTGAAGCTG 2 0 

40 

2 ) INFORMATION FOR SEQ ID NO: 607 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: JEnte.ro coccus faecalls 
55 (B) STRAIN: ATCC 29212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607 

CGGAGCTATC TTAGTAGTTT CTGCTGCTGA TGGTCCTATG CCTCAAACAC 50 
60 GTGAACATAT CTTATTATCA CGTAACGTTG GTGTACCATA CATCGTTGTA 100 

555 
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TTCTTAAACA AAATGGATAT GGTTGATGAC GAAGAATTAT TAGAATTAGT 15 0 

AGAAATGGAA GTTC GTG ACT TATTATCAGA ATACGATTTC CCAGGCGATG 2 00 

ATGTTC CAGT TATCGCAGGT TCTGCTTTGA AAGCTTTAGA AGGCGACGAG 250 

TCTTATGAAG AAAAAATCTT AGAATTAATG GCTGCAGTTG ACGAATATAT 300 

5 CCCAACTCCA G AAC GTG AT A CTGACAAACC ATTCATGATG CCAGTCGAAG 3 50 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG ACGTGTTGAA 40 0 

CGTGGTGAAG TTCGCGTTGG TGACGAAGTT GAAATCGTTG GTATTAAAGA 45 0 

CGAAACATCT AAAACAACTG TTACAGGTGT TGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCAGGC GACAACTTCG GTGCTTTATT ACGTGGTGTA 55 0 

10 GCACGTGAAG ATATCGAACG TGGACAAGTA TTAGCTAAAC CAGCTACAAT 600 

CACTCCACAC ACAAAATTCA AAGCTGAAGT ATACGTATTA TCAAAAGAAG 65 0 

AAGGCGGACG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAATTCTAC 7 00 

TTCCGTACAA CAGACGTTAC TGGTGTTGTA GAATTGCCAG AAGGTAC TGA 7 50 

AATGGTAATG CCTGGTGATA ACGTTGCTAT GGACGTTGAA TTAATTCACC 800 

15 CAATCGCTAT CGAAGACGGA A 821 



20 



2) INFORMATION FOR SEQ ID NO: 608 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterococcus faecixim 

(B) STRAIN: ATCC 19434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608 

3 5 CGGAGCTATC TTGGTAGTTT CTGCTGCTGA CGGCCCAATG CCTCAAACTC 50 

GTGAACACAT CCTATTGTCT CGTCAAGTTG GTGTTCCTTA CATCGTTGTA 100 

TTCTTGAACA AAGTAGACAT GGTTGATGAC GAAGAATTAC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACC TATTAACAGA ATACRAATTC CCTGGTGRCG 2 00 

ATGTTC CTGT AGTTGC TGG A TCAGCTTTGA AAGCTC TAG A AGGCGACGCT 2 50 

40 TCATACGAAG AAAAAATTCT TGAATTAATG GCTGCAGTTG ACGAATACAT 3 00 

CCCAACTCCA G AAC GTG AC A AC GAC AAACC ATTCATGATG CCAGTTGAAG 350 

ACGTGTTCTC AATTACTGGA CGTGGTACTG TTGCTACAGG TCGTGTTGAA 400 

CGTGGACAAG TTCGCGTTGG TGACGAAGTT GAAGTTGTTG GTATTGCTGA 450 

AG AAAC TTC A AAAACAACAG TTACTGGTGT TGAAATGTTC CGTAAATTGT 500 

45 TAG AC Y AC GC TGAAGCTGGA GACRACATTG GTGCTTTACT ACGTGGTGTT 5 50 

GCACGTGAAG ACATCCAACG TGGACAAGTT TTAGCTAAAC CAGGTACAAT 600 

CACACCTCRT ACAAAATTCT CTGCAGAAGT ATACGTGTTG ACAAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCACTA ACTACCGTCC AC AATTC T AC 7 00 

TTCCGTACAA CTGACGTAAC AGGTGTTGTT GAATTACCAG AAGGAACTGA 7 50 

50 A 751 



2 ) INFORMATION FOR SEQ ID NO: 609 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

556 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 609 

10 CGGTGCGATC TTAGTAGTAT CTGCTGCTGA CGGTCCTATG CCTCAAACTC 5 0 

GTGAACACAT CTTGTTATCA CGTAACGTTG GCGWACCATA CATC GTTGTT 100 

TTCTTGAACA AAATGGATAT GGTTGAYGAC GAAGAATTGC TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACC TATTGTCTGA ATATGACTTC CCAGGCGACG 2 00 

ATGTTC CTGT AATCGCCGGT TCTGCTTTGA AAGCTCTTGA AGGAGATCCT 2 50 

15 TCATACGAAG AAAAAATCAT GGAATTGATG GCTGCAGTTG ACGAATACGT 3 00 

TCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGACAAG TTCGCGTTGG TGATGAAGTA GAAATCGTTG GTATTGCTGA 45 0 

CGAAACTGCT AAAACAACTG TAACAGGTGT TGAAATGTTC C GT AAATTGT 5 00 

2 0 TAGACTATGC TGAAGCAGGG GATAACATTG GTGCATTGCT ACGTGGGGTT 55 0 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAG CTGGTACRAT 6 00 

CACACCTCAT ACAAAATTCA AAGCTGAAGT TTATGTTTTG ACAAAAGAAG 65 0 

AAGGTGGRCG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAGTTC TAC 7 00 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GAATTAC C AG AAGGAACTGA 7 50 

25 A 751 



30 



2 ) INFORMATION FOR SEQ ID NO: 610 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doub 1 e 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Haemophilus influenzae Rd 

(B) STRAIN: KW20 

(C) ACCESSION NUMBER: extracted from U32739 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 610 



AATATGATTA CTGGTGCGGC ACAAATGGAT GGTGCTATTT TAGTAGTAGC 50 

AGCAACAGAT GGTCCTATGC CACAAACTCG TGAACACATC TTATTAGGTC 100 

GCCAAGTAGG TGTTCCATAC ATCATCGTAT TCTTAAACAA ATGCGACATG 150 

GTAGATGACG AAGAGTTATT AGAATTAGTC GAAATGGAAG TTCGTGAACT 200 

50 TCTATCTCAA TATGACTTCC CAGGTGACGA TACACCAATC GTACGTGGTT 250 

CAGCATTACA AGCGTTAAAC GGCGTAGCAG AATGGGAAGA AAAAATCCTT 300 

GAGTTAGCAA ACCACTTAGA TACTTACATC CCAGAACCAG AACGTGCGAT 350 

TGACCAACCG TTCCTTCTTC CAATCGAAGA TGTGTTCTCA ATCTCAGGTC 400 

GTGGTACTGT AGTAACAGGT CGTGTAGAAC GAGGTATTAT CCGTACAGGT 450 

55 GATGAAGTAG AAATCGTCGG TATCAAAGAT ACAGCGAAAA CTACTGTAAC 500 

GGGTGTTGAA ATGTTCCGTA AATTACTTGA CGAAGGTCGT GCAGGTGAAA 55 0 

ACATCGGTGC ATT ATT AC GT GGTACCAAAC GTGAAGAAAT CGAACGTGGT 600 

CAAGTATTAG CGAAACCAGG TTCAATCACA CCACACACTG ACTTCGAATC 650 

AGAAGTGTAC GTATTATCAA AAGATGAAGG TGGTCGTCAT ACTCCATTCT 700 

60 TCAAAGGTTA CCGTCCACAA TTCTATTTCC GTACAACAGA CGTGACTGGT 75 0 

557 
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ACAATCGAAT TACCAGAAGG CGTGGAAATG GTAATGCCAG GCGATAACAT 800 
CAAGATGACA GTAAGCTTAA TCCACCCAAT TGCGATGGAT CAAGGTTTAC 850 
GTTTCGCAAT CCGTGAAGGT GGCCGTACAG TAGGTGCAGG C 891 



2 ) INFORMATION FOR SEQ ID NO: 611 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: t , 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 611 



CGGCGGTATC TTAGTTGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT C TTATTATC A CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

25 TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 200 

ATGTACCTGT AATCGCTGGT TCTGCATTAA AAGCATTAGA AGGCGATGCT 250 

GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCAGTTG ATGATTACAT 300 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

30 ACGTATTCTC AATC AC TGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 400 

CGTGGTC AAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATGCACGA 450 

AAC TTC T AAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

ACTACGC TG A AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCA 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTC T ATT AC 600 

35 ACCACACACA AAATTCAAAG CTGAAGTATA CGTATTATCT AAAGATGAAG 650 

GTGGACGTCA CACTCCATTC TTCACTAACT ATCGCCCACA ATTCTATTTC 7 00 

CGTACTACTG ACGTAAC TGG TGTTGTAAAC TTACCAGAAG GTACAGAAAT 7 50 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCAA 800 

TCGC TATCG A AGACGGAA 818 

40 



2) INFORMATION FOR SEQ ID NO: 612 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
55 serotype Parathyphi A 

(B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612 

60 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

558 
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TGAGCACATC 
TCCTGAACAA 
GAAATGGAAG 
CACGCCGATC 
5 AGTGGGAAGC 
CCGGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACTCAGAAGT 

10 CGAAGGCCGT 
GTGAAGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACTACTGA 

15 GTAATGCCGG 
CGCAATGGAC 



CTGCTGGGTC 
ATGCGACATG 
TTCGTGAACT 
GTTCGTGGTT 
GAAAATCATC 
AGCGTGCGAT 
ATCTCCGGTC 
CAAAGTGGGC 
CTACCTGTAC 
GCCGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACTGGC 
GCGACAACAT 
GACGGTCTGC 



GTCAGGTAGG 
GTTGATGACG 
TCTGTCTCAG 
CTGCTCTGAA 
GAACTGGCTG 
TGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAAATGGTT 
GTTTC 



CGTTCCGTAC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
GCTTCCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAGCCGGG 
ATTCTGTCCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTTACCCTGA 



ATCATCGTGT 100 

GGAACTGGTT 150 

CGGGCGACGA 2 00 

GGCGACGCAG 2 5 0 

TTCTTACATC 3 00 

CGATCGAAGA 3 50 

CGTGTAGAAC 400 

TATCAAAGAG 450 

AACTGCTGGA 5 00 

GGTATCAAAC 5 50 

CACCATCAAG 600 

AAGATGAAGG 650 

TTCTACTTCC 7 00 

CGTAGAGATG 75 0 

TCCACCCGAT 800 
825 



20 2 ) INFORMATION FOR SEQ ID NO: 613 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 

(B) STRAIN: ATCC 33105 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 613 



GGCGCTATCC TGGTTGTTGC TGCGACTGAC GGCCCAATGC CTCAGACCCG 5 0 

TGAGCACATC CTGCTGGGTC GYCAGGTTGG CGTTCCTTTC ATCATCGTRT 100 

TCATGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 15 0 

GAAATGGAAG TTCGCGAACT GCTGTCCGCT TACGACTTCC CTGGCGATGA 200 

40 CCTGCCGGTG ATTCGCGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 25 0 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG AMCMSCTGGA TACTTACATC 300 

CCAGAACCAG AGCGCGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT GGTTACCGGT CGTGTTGAGC 400 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

45 ACCGTCAAGT CTACCTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 55 0 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCCATCAAG 600 

CCGCACACCC AGTTCGATTC AGAAGTGTAC ATCCTGAGCA AAGAAGAAGG 650 

TGGTCGTCAC ACKCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 7 00 

50 GTACTACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTAATGCCTG GCGACAACGT GAACATGA 778 



55 2 ) INFORMATION FOR SEQ ID NO: 614 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

559 
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10 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 614 



GTGCGATCTT AGTAGTATCA GCTACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACACATTT TGTTATCACG TCAAGTTGGT GTTAAGCACT TGATCGTTTT 100 

CTTGAACAAA GTAGATTTAG TTGATGACGA AGAATTGATC GACTTAGTTG 150 

AAATGGAAGT AC GTG AATT A CTTTCTGAAT ATGGTTTCCC AGGTGATGAT 2 00 

15 ATTCCAGTGC TTAAAGGTTC TGC TTTG AAA GCATTAGAAG GCGATCCAGA 250 

ACAAGAACAA GTTATTCTTG ATTTGATGGA TACCGTTGAT GAATATATCC 300 

CAACACCTGA ACGTGACAAT GACAAACCGT TCTTGTTACC AGTTGAGGAT 3 50 

GTTTTCTCGA TCACAGGACG TGGTACTGTA GCTTCTGGTC GTATCGACCG 400 

TGGCGAAGTT AAAGTCGGCG ATGAAATTGA AATCATCGGG ATCAAACCTG 450 

20 AAGTTCAAAA AGC AATCGTT ACTGGACTTG AAATGTTCCG TAAAACATTG 500 

GATTATGGTG AAGCTGGCGA TAACGTTGGG GTTC T ATT AC GTGGGATTAC 550 

ACGTGATGAA ATCGAACGTG GCCAAGTATT AGCTAAACCA GGTTCAATCA 600 

CACCACATAC TAAGTTCAAA GCCGAAGTAT ATGTGTTGAC GAAAGAAGAA 650 

GGT 653 

25 



2 ) INFORMATION FOR SEQ ID NO: 615 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 
40 (B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615 

CCATTCTAGT TGTATCTGCA ACAGATGGAC CAATGCCACA AAC AC GTGAA 50 

45 CATATTTTAT TGTCACGTCA AGTAGGTGTT AAATATTTGA TCGTCTTCTT 100 

GAACAAAATC GACTTAGTAG ATGATGAAGA ATTGATTGAT CTTGTCGAAA 150 

TGGAAGTTCG TGAATTATTA AGCGAATATG GTTTCCCAGG TGACGATACA 200 

CCAGTCATCA AAGGTTCAGC ATTAAAAGCT TTACAAGGAG ATCCTGATGC 2 50 

AGAAGCAGCT ATCATGGAAT TGATGGATAC TGTTGATGAA TATATCCCAA 3 00 

5 0 CACCAGAACG TGATACAGAC AAACCATTAT TGTTACCAGT GGAAGATGTC 3 50 

TTCTCAATCA CAGGTCGTGG GACTGTTGCT TCAGGTCGTA TCGATCGTGG 400 

TGCAGTTCGT GTAGGTGATG AAATCGAAAT CGTCGGTATC AAACCTGAAA 450 

CACAAAAAGC TGTTGTAACT GGGGTCGAAA TGTTCCGCAA GACATTAGAC 500 

TATGGTGAAG CAGGAGATAA CGTTGGGGTA TTGTTACGTG GT ATCC AAC G 550 

55 TGAAGATATC G AAC GTGG AC AAGTAATCGC AAAACCAGGT TCAATCACAC 600 

CACATACAAA ATTCAAAGCA GAAGTGTACG TATTGACAAA AGAAGAAGGT 650 

GGACGTCATA CACCATTCTT CAATAACTAT CGTCCACAAT TCTACTTCCG 7 00 

TACAACTGAC GTAACTGGAA CAATCGTTTT AC C TGGAGGC ACTGAAATGG 750 

TTATGCCTGG AGATAACGTA ACGATCGACG TTGAATTGAT CCATCCAGTT 800 

60 GCCATCGAAA ACGGAACAAC TTTCTCTAT 829 

560 
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2 ) INFORMATION FOR SEQ ID NO : 616 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616 

20 GGTGCAATTT TAGTAGTATC TGCTACTGAT GGCCCAATGC C AC AAAC AC G 50 

TGAACATATC TTGTTATCAC GTCAAGTAGG GGTTAAACAC TTAATCGTCT 100 

TCTTGAACAA AGTTGATTTA GTTGATGATG AAGAATTGAT CGATTTAGTT 150 

GAAATGGAAG TTCGGGAATT GCTTTCTGAA TATGGTTTCC CAGGCGATGA 2 00 

TATTCCAGTA CTTAAAGGTT CTGCTTTGAA AGCTTTAGAA GGCGATCCTG 250 

2 5 AACAAGAACA AGTAATCCTT GACTTGATGG ATACGGTTGA TGAATACATC 3 00 

CCAACGCCTG AACGTGATAC TGACAAACCA TTCTTGTTAC CAGTCGAAGA 35 0 

TGTCTTCTCA ATCACAGGAC GTGGTACGGT TGCATCTGGT CGTATCGATC 400 

GTGGGGAAGT TAAAGTCGGT GATGAAGTTG AAATCATCGG GATCAAACCT 450 

GAAGTGCAAA AAGCTGTCGT AACTGGACTA GAAATGTTCC GTAAGACATT 500 

3 0 GGATTACGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CTCGTGATGA . AATCGAACGT GGACAAGTAT TAGCTAAACC AGGTTCAATC 600 
ACTCCACATA CGAAATTCAG TGCAGAAGTT TATGTATTGA CGAAAGAAGA 65 0 

AGGTGGCCGT CATACGCCA 669 



35 



50 



2) INFORMATION FOR SEQ ID NO: 617 



(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dlspar 

(B) STRAIN: ATCC 51266 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617 



CGGGGCAATT TTAGTTGTAT CTGCAACTGA TGGCCCAATG CCACAAACAC . 50 

GTGAACACAT TTTGTTAGCT CGTCAAGTAG GGGTTAAATA TTTAATCGTC 100 

55 TTCTTGAACA AAACAGATTT AGTTGATGAT GAAGAATTAT TGGAACTAGT 150 

TGAAATGGAA GTTCGTGAAT TATTAAATGA ATACAATTTC CCTGGCGATG 2 00 

ATATTCCTGT TATTCGCGGA TCTGCTTTAA AAGCATTAGA AGGCGATCCA 25 0 

GAACAAGAAG AAGTAATTAT GAACTTGATG GATACTGTGG ATGAATATAT 3 00 

CCCAACTCCA GAACGTGACA ATGATAAACC ATTCTTGTTA CCAGTGGAAG 3 50 

60 ATGTCTTCAC AATTACTGGT CGTGGTACTG TTGCTTCAGG TCGTATCGAC 400 

561 
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CGTGGTAAAG TCAACGTTGG TGATGAAATT 
AGAAACACAA AAAGCTGTTG TAACCGGTTT 
TGGATTATGG TGAAGCTGGT GATAACGTTG 
ACTCGTGATG AAGTAGAACG TGGTCAAGTA 
5 T AC AC CG CAT AC C AAATTT A AAGGTGAAGT 
AAGGTGG AC G TCATACTCCT TTCTTTAATA 
TTCCGTACAA C TG ATGTG AC TGGTAACATC 
AATGGTAATG C C TGGTG AT A ATGTAACAAT 
CAATCGCCGT TGAAAAAGGG ACT ACTTTC T 



GAAATTATCG G AATTAAAC C 450 

GGAAATGTTC CGTAAAACTT 500 

GGGTCTTATT ACGTGGGATT 5 50 

TTAGCAAAAC CAGGTTCCAT 600 

TTATATCTTA ACAAAAGAAG 650 

ACTATCGTCC TCAATTTTAT 7 00 

GCATTACCTG AAGGAACTGA 7 50 

TGAAGTTGAA TTGATTCATC 800 

CAATT 835 



2 ) INFORMATION FOR SEQ ID NO : 618 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 
25 (B) STRAIN: ATCC 14025 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618 

GGTGCAATCC TAGTAGTATC AGCTACTGAT GGTCCAATGC CGCAAACACG 50 

30 TGAACATATT TTGCTATCAC GGCAAGTGGG TGTTAAACAC TTAATCGTAT 100 

TTTTAAACAA AGTTGATTTA GTCGATGATG AAGAATTGAT CGATCTAGTT 150 

GAAATGGAAG TCCGTGAATT ACTTTCTGAA TATGGTTTCC CAGGTGACGA 200 

TATTCCAGTT CTCAAAGGTT CAGCTTTGAA AGCATTAGAA GGCGATCCTG 2 50 

AACAAGAACA AGTAATCCTT GATTTAATGG ATACAGTTGA CGAATATATC 3 00 

3 5 CCAACTCCAG AAC GTG AC AC TGACAAGCCA TTCTTGTTAC CAGTCGAAGA 3 50 

TGTATTTTCT ATCACTGGTC GTGGG AC TGT AGCGTCTGGA CGGATTGATC 400 

GTGGTGAAGT TAAAGTCGGC GATGAAGTTG AAATCATCGG GATCAAACCT 450 

GAAATTCAAA AAGCAGTCGT AACTGGACTT GAAATGTTCC GTAAAACTTT 500 

AG ATTATGG T GAAGCTGGCG AT AAC GTTGG GGTTC T ATT A CGTGGGATTA 5 50 

40 CACGTGATGA AATCGAACGT GGTCAAGTCT TAGCTAAACC AGGTTCAATC 600 

ACACCACATA CAAAATTCAG TGCAGAAGTT TACGTATTGA CGAAAGAAGA 650 

AGGTGGACGT CATACACCAT CTT 673 



45 



55 



2 ) INFORMATION FOR SEQ ID NO: 619 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1713 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sac char omyces cerevlsiae 
(C) ACCESSION NUMBER: K00428 



60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619 

562 
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TTTCATATTT TTAAGGATTT TGTTTTAGCA CCCATCCGAC CTCAGTCAAT 
ATATCCTTTC GCGACCAGGC TTTCCTCCCT TTTGCTGCTA ACTGGTTACA 
GATTTTCCTA TTTTTGGTCA TTTTTATCTT TGAAACTGAT TAAGC TGAAA 
5 AAATTTGAGC TTCTTTGTTG TAAACTATTT TGTGCTTTCA GTTTTATTCT 
AGCTCGACAA AGGTAACAGA CAAAAATGTC AGCTTTATTA CCAAGATTAC 
TCACAAGAAC AGCTTTTAAA GCTTCTGGGA AACTTCTGAG GCTCTCTTCA 
GTAATTTCTA GG AC C TTTTC TCAAACTACT ACTTCCTATG CAGCTGCTTT 
TGATCGTTCC AAACCGCATG TAAATATAGG TACGATCGGC CATGTTGATC 

10 ATGGGAAGAC AACTTTAACC GCAGCCATTA CGAAAACGTT AGCCGCAAAA 
GGTGGTGCCA ACTTCTTGGA CTATGCTGCC ATCGATAAGG CTCCGGAAGA 
AAGAGCTCGT GGTATTACAA TTTCTACTGC ACACGTGGAA TACGAAACGG 
CCAAGAGACA TTATTCTCAC GTCGACTGTC CAGGCCACGC TGATTACATC 
AAGAATATGA TTACCGGTGC TGCTCAAATG GATGGTGCTA TCATTGTTGT 

15 AGCTGCTACC GATGGACAAA TGCCCCAAAC TAGAGAACAT TTAC TTTTGG 
CCAGACAAGT TGGTGTCCAA CATATTGTCG TTTTTGTTAA CAAGGTTGAT 
ACCATTGATG ATCCAGAAAT GTTAGAGTTA GTCGAAATGG AAATGAGAGA 
ACTTTTAAAC GAATATGGGT TTGAC GGTG A TAATGCTCCA ATTATCATGG 
GTTCTGCCCT TTGCGCTTTG GAAGGTCGCC AACCTGAAAT TGGGGAGCAG 

2 0 GCCATCATGA AAC TTTTGG A TGCAGTGGAT GAGTATATTC CTACACCTGA 
AAGAGATTTG AACAAGCCTT TCTTGATGCC CGTTGAAGAT ATCTTCTCTA 
TCTCCGGTAG AGGTAC TGTG GTCACTGGTC GTGTGGAAAG GGGTAATTTA 
AAGAAAGGTG AGGAATTGGA AATTGTTGGT CACAACTCCA CCCCATTGAA 
AACAACAGTT AC TGGTATTG AAATGTTTAG AAAGGAATTG GACTCTGCTA 

2 5 TGGCAGGTGA CAATGCCGGT GTTTTACTTA GAGGTATCAG GAGAGATCAA 

TTGAAGAGAG GTATGGTCTT AGCTAAGCCA GGTACCGTTA AAGCC C AT AC 
AAAGATTCTA GCCTCTTTGT ACATTTTATC CAAAGAGGAA GGTGGTAGAC 
ATTCTGGGTT TGGTGAAAAC TACAGACCAC AAATGTTTAT AAGAACAGCT 
GATGTTACAG TTGTGATGAG ATTTCCTAAG GAGGTTGAAG ATCATTCTAT 

3 0 GCAAGTTATG CCAGGTGACA ATGTTGAAAT GGAATGTGAT TTGATCCATC 

CTACCCCATT AGAAGTTGGT CAACGTTTCA ATATCAGAGA GGGTGGAAGA 
ACTGTTGGTA CCGGTCTAAT C AC AC GT ATT ATTGAATAGA CTTATTGATG 
CAACTGGAGT ATATTTCTAT ATATTC TGTT CATTTCCCCT CTCATAATAT 
. ATAC TTGTTT CGTTAAAATT TTATACGTGT AAATAAAGTG CCATAAATTT 
3 5 TTC AG CTTTA CTTTTGGTAG AGTC CTGCT A GC AC TAG ATT TTACAATTTC 
ATGTGCACAC ACC 



40 2 ) INFORMATION FOR SEQ ID NO: 620 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1700 
1713 



45 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620 



ATTGGTGCAT TGCTACGT 



18 



55 



2 ) INFORMATION FOR SEQ ID NO: 621 



60 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 751 bases 
<B) TYPE: Nucleic acid 



563 
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(C) STRAND EDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621 

TGGTGCAATC TTAGTTGTTT CTGCAACTGA CGGTCCGATG CCTCAAACAC 50 

GTGAACACAT TTTATTGTCA CGCCAAGTTG GTGTAAAATA CCTGATTGTT 100 

TTCTTGAACA AAGTTGATTT AGTCGATGAT GAAGAATTGA TCGATTTGGT 150 

15 AGAAATGGAA GTTCGCGAGT TATTGAGCGA ATATGGTTTC CCAGGCGATG 2 00 

ACACTCCTGT GATCAAAGGT TCCGCATTAA AAGCATTGCA AGGCGATCCA 2 50 

GATGCTGAAG CTGCTATTAT GGAATTGATG GATACAGTAG ATGAATATAT 3 00 

CCCAACACCA GAACGTGATA CAGATAAACC ATT AC T ATTG CCAGTGGAAG 3 50 

ACGTCTTCTC AATTACAGGT CGAGGAACTG TTGCCTCAGG TCGTATTGAT 400 

20 CGTGGTGCTG TTCGTGTCGG TGATGAGGTA GAGATCGTAG GGATCAAACC 450 

TGAAACACAA AAAGCAGTTG TAACAGGTGT AG AAATG TTC CGTAAAACGT 500 

TAGATTACGG GG AAGCTGGG GATAACGTAG GCGTGTTGTT ACGGGGGATC 550 

CAACGTGACG ATATCGAACG TGGACAAGTA CTTGCTAAAC CAGGTTCCAT 600 

TACTCCACAT ACAAAATTCA AAGCAGAAGT GTAC GTGTTG ACAAAAGAAG 650 

2 5 AAGGTGGACG TCATACTCCA TTCTTCAACA ACTATCGTCC ACAGTTC T AC 700 

TTCCGCACAA CTGATGTTAC AGGAACAATC ACATTGCCAG AAGATACAGA 750 



30 



40 



-2 ) INFORMATION FOR SEQ ID NO:- 622 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 750 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: ATCC 13264 



751 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622 

GTCAAATGGG ACGAATCCAG ATTCCAAGAA ATTGTCAAGG AAACCTCCAA 50 

CTTTATCAAG AAGGTTGGTT ACAACCCAAA GACTGTTCCA TTCGTCCCAA 100 

TCTCTGGTTG GAACGGTGAC AACATGATTG AAGCTACCAC CAACGCTCCA 150 

50 TGGTACAAGG GTTGGGAAAA GGAAACCAAG GCCGGTGTCG TCAAGGGTAA 200 

GACTTTGTTG GAAGC C ATTG ACGCCATTGA ACAACCATCT AGACCAACTG 250 

AC AAGC C ATT G AGATTGC C A TTGCAAGATG TTTACAAGAT TGGTGGTATT 3 00 

GGTACTGTGC CAGTCGGTAG AGTTGAAACC GGTGTCATCA AGCCAGGTAT 350 

GGTTGTTACT TTCGCCCCAG CTGGTGTTAC CACTGAAGTC AAGTC CGTTG 400 

55 AAATGCATCA CGAACAATTG GAACAAGGTG TTCCAGGTGA CAACGTTGGT 450 

TTCAACGTCA AGAACGTTTC CGTTAAGGAA ATCAGAAGAG GTAACGTCTG 500 

TGGTGACGCT AAGAACGATC CACCAAAGGG TTGCGCTTCT TTCAACGCTA 550 

CCGTCATTGT TTTGAACCAT CCAGGTCAAA TCTCTGCTGG TTACTCTCCA 600 

GTTTTGGATT GTCACACTGC TCACATTGCT TGTAGATTCG AC G AATTGTT 650 

60 GGAAAAGAAC GACAGAAGAT CTGGTAAGAA GTTGGAAGAC CATCCAAAGT 7 00 

564 
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TCTTGAAGTC CGGTGACGCT GCTTTGGTCA AGTTCGTTCC ATCTAAGC C A 7 50 



5 2 ) INFORMATION FOR SEQ ID NO: 623 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 69 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 <vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623 



TCTTGAAAGC TTAAGGCCGA GCGAGAGCGA GGTATC AC C A TCGACATTGC 50 
TCTTTGGAAG TTCGAGACCC CCAGGTACCA GGTCACCGTC ATTGACGCCC 100 
CCGGTCACCG AGACTTCATC AAGAACATGA TCACCGGTAC CTCCCAGGCT 150 
GACTGTGCCA TCCTCATCAT TGCCACCGGT ATCGGTGAGT TCGAGGCCGG 2 00 

25 TATCTCCAAG GAC GGTC AG A CCCGAGAGCA CGCCCTCCTC GCCTTCACCC 2 50 

TCGGTGTCAG GCAGCTCATT GTTGCTTGCA ACAAGATGGA CACCTGCAAG 3 00 

TGGTCCGAGG ACCGATTCAA CGAAATCGTC AAGG AG AC C A ACGGTTTCAT 350 
CAAGAAGGTT GGTTACAACC CCAAGGCTGT CCCCTTCGTC CCCATCTCTG 4 00 

GTTGGCACGG TGACAACATG TTGGAGGAGA CCACCAAGTC AGTGAATCCG 4 50 

30 CTTCTACGTG ATGAGATGTT TTTCTGACTT TCCGTGCAGC ATGCCCTGGT 5 00 

ACAAGGGATG G AC C AAGGAG ACCAAGTCTG GTGTTTCCAA GGGTAAGACC 550 
CTCCTCGAGG CCATCGACGC CATCGAGCCC CCTACCCGAC CCACCGACAA 600 
GCCCCTCCGT CTCCCTCTCC AGG AC GTCTA CAAGATCGGT GGTATCGGCA 650 
CAGTCCCTGT CGGCCGAGTC GAGACCGGTG TCATCAAGGC CGGTATGTTG 7 00 

3 5 TCTCATCTCT CTTGTCTCGT AACATGCGTC TCGTAACATG CGCTTACTTC 7 50 

ATTTTCAGGT ATGGTCGTCA AGTTCGCCCC CACCAACGTC ACCACTGAAG 8 00 

TCAAGTCCGT TGAGATGCAC CACGAGCAGA TCCCCGAGGG TCTTCCCGGA 8 50 

GACAACGTTG GTTTCAACGT CAAGAACGTT TCCATCAAGG ACATCCGACG 900 
AGGTAACGTC TGTGGTGACT CCAAGAACGA CCCCCCTATG GAGGCTGCTT 9 50 

40 CTTTCAACGC CCAGGTTATC GTCCTTAACC ACCCTGGTCA GATCGGTGCC 1000 

GGTTACACCC CCGTTCTCGA CTGTCACACT GCCCGTAAGC CTGACCCAAT 1050 

ACCTCCAACA TACCTTTGAA GCTGACCCTT TCTAGACATT GCCTGCAAGT 1100 

TTGCTGAGTT GATCGAGAAG ATTGACCGAC GAACCGGTAA GGTCATGGAG 1150 

GCCGCCCCCA AGTTCGTCAA GTC TGGTG AC GCCGCCATTG TCAAGCTTGT 12 00 

45 TGCCCAGAAG CCCCTCTGTG TTGAGACCTA CGCCGACTAC CCCCCTCTTG 12 5 0 

GTCGATTCGC CGTCCGAGA 12 69 



50 2) INFORMATION FOR SEQ ID NO: 624 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



60 (vi) ORIGINAL SOURCE: 

565 
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(A) ORGANISM: Candida albicans 

(B) STRAIN : ATCC 36801 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624 



TCTGTCAAAT GGGACAAAAA CAGATTTGAA GAAATCATCA AGGAAACCTC 50 

CAACTTCGTC AAGAAGGTTG GTTACAACCC AAAGACTGTT CCATTCGTTC 100 

CAATCTCTGG TTGGAATGGT GACAACATGA TTG AAC CATC CACCAACTGT 15 0 

CCATGGTACA AGGGTTGGGA AAAGGAAACC AAATCCGGTA AAGTTACTGG 2 00 

10 TAAGACCTTG TTAGAAGCTA TTGACGCTAT TGAACCACCA ACC AG AC C AA 250 

CCGACAAACC ATTGAGATTG CCATTGCAAG ATGTTTACAA GATCGGTGGT 3 00 

ATTGGTACTG TGCCAGTCGG TAGAGTTGAA ACTGGTATCA TCAAAGCCGG 350 

TATGGTTGTT ACTTTCGCCC CAGCTGGTGT TAC C AC TG AA GTCAAATCCG 400 

TTGAAATGCA TCACGAACAA TTGGCTGAAG GTGTTCCAGG TGACAATGTT 450 

15 GGTTTCAACG TTAAGAACGT TTCCGTTAAA GAAATTAGAA GAGGTAACGT 500 

TTGTGGTGAC TCCAAGAACG ATCCACCAAA GGGTTG TG AC TCTTTC AATG 550 

CCCAAGTCAT TGTTTTGAAC CATCCAGGTC AAATCTCTGC TGGTTACTCT 600 

CCAGTCTTGG ATTGTCACCC TGCCCACATT GCTTGTAAAT TCG AC AC TTT 650 

GGTTGAAAAG ATTGACAGAA GAACTGGTAA GAAATTGGAA GAAAATCCAA 7 00 

2 0 AATTCGTCAA ATCCGGTGAT GCTGCTATCG TCAAGATGGT CCCAACCAAA 750 

CCA 753 



2 5 2 ) INFORMATION FOR SEQ ID NO: 625 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Single 

(-D) —TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625 

40 CGTTGAAGAC ACGACCCAAA GTATCC 26 



2 ) INFORMATION FOR SEQ ID NO: 626 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Streptococcus agalactiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626 

TACCACCTTT TAAGTAAGGT GCTAAT 26 

60 
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10 



15 



20 



40 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 627 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627 

ATTGTC TATA AAAATGGCGA TAAGTC 2 6 



2 ) INFORMATION FOR SEQ ID NO : 628 



(i) SEQUENCE CHARACTERISTICS: 

2 5 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628 

AAAATGGCGA TAAGTCACAA AAAGTA 2 6 



2 ) INFORMATION FOR SEQ ID NO: 629 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629 

AAGTTCCATC TCAACAAGGT CAATA 2 5 



60 2 ) INFORMATION FOR SEQ ID NO: 630 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630 
CGGAGCTATC CTAGTCGTTT CA 22 



2 ) INFORMATION FOR SEQ ID NO: 631 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 631 
CAGACC AACY * ' GAI AARCC AT TRAGAT 



35 2 ) INFORMATION FOR SEQ ID NO: 632 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632 

CCCTTTGGTG GRTC STKCTT GGA 



(i) 



25 



(ii) 
(xi) 



2 ) INFORMATION FOR SEQ ID NO : 633 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

60 

568 



CA 02307010 2000-05-19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633 
CAGACCAACY GAIAARCCIT TRAGAT 2 6 

5 

2 ) INFORMATION FOR SEQ ID NO: 634 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 4 

AACACYGTCA GRRCIATTGC YATGGA 9fi 
20 Zt 

2 ) INFORMATION FOR SEQ ID NO: 635 

25 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635 
35 AAACCRGTIA RRGCRACTCT IGCTCT 26 



40 



50 



2) INFORMATION FOR SEQ ID NO : 63 6 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 63 6 
ACTGGYGTTG AIATGTTCCG YAA 23 



55 2 ) INFORMATION FOR SEQ ID NO: 637 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637 

ACGTCAGTIG TACGGAARTA GAA 2 3 



10 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 638 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 638 
CCAATGCCAC AAAC I CGTG A RCACAT 2 6 

2 ) INFORMATION FOR SEQ ID NO : 639 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639 
TTTACGGAAC ATTTCWACAC CWGTIACA 28 

40 

2 ) INFORMATION FOR SEQ ID NO: 640 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640 
55 . TCCATGGTIT WYGG I CARAT GAA 23 



2) INFORMATION FOR SEQ ID NO: 641 

60 
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10 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 641 
TGATAACCWA CIGCIGAIGG CATACG 2 6 



15 2 ) INFORMATION FOR SEQ ID NO: 642 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642 

GGCGTIGGIG ARCGIACICG TGA 23 



30 



40 



45 



2TINFORMATrON~FOR~~SEQ ID NO : 643 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
35 (B) TYPE: Nucleic acid 

{C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643 
ACTGGIGTIG ARATGTTCCG YAA 23 

2 ) INFORMATION FOR SEQ ID NO: 644 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644 

ACGTCIGTIG TICKGAARTA GAA 
60 Z6 
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10 



20 



2) INFORMATION FOR SEQ ID NO: 645 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645 
15 ACGTCIGTIG TICKGAARTA RAA 23 



2 } INFORMATION FOR SEQ ID NO: 64 6 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 646 

30 

ATCGACAAGC CITTCYTIAT GSC 23 



35 2 ) INFORMATION FOR SEQ ID NO: 647 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 647 

ACGTCCGTSG TRCGGAAGTA GAACTG 2 6 



50 

2) INFORMATION FOR SEQ ID NO: 648 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 8 
ACGTCSGTSG TRCGGAAGTA GAACTG 2 6 

5 

2 ) INFORMATION FOR SEQ ID NO: 649 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 649 
GTCCTATGCC TCARACWCGI GAGCAC 2 6 

20 

2 ) INFORMATION FOR SEQ ID NO: 650 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650 
35 TTACGGAACA TYTCAACACC IGT 23 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 651 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 651 
TGACGACCAC C I TCYTCYTT YTTCA 25 



55 2 ) INFORMATION FOR SEQ ID NO : 652 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652 

CCWAYAGTIY KICCICCYTC YCTIATA 27 



10 



20 



25 



2) INFORMATION FOR SEQ ID NO: 653 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 653 
GAYTTCATIA ARAAYATGAT 2 0 

2 ) INFORMATION FOR SEQ ID NO: 654 



(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654 
TACAARATYK GIGGTATYGG 2 0 

40 

2 ) INFORMATION FOR SEQ ID NO : 655 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655 
55 CCRATACCIC MRATYTTGTA 20 



2) INFORMATION FOR SEQ ID NO: 656 

60 
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10 



30 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 6 
AATTAATGGC TGCAGTTGAY GA 22 



15 2 ) INFORMATION FOR SEQ ID NO : 657 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 657 

TTGTCCACGT TCGATRTCTT CA 22 



2 ) INFORMATION FOR. SEQ ID NO: 658 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
35 (B) TYPE: Nucleic acid • 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658 
GATYTAGTCG ATGATGAAGA ATT 23 

2 ) INFORMATION FOR SEQ ID NO: 659 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659 

GCTTTTTGIG TTTCWGGTTT RAT 9; 
60 Z6 
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10 



20 



2 ) INFORMATION FOR SEQ ID NO: 660 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660 
15 GTAGAATTGA GGACGGTAGT TAG 23 



2 ) INFORMATION FOR SEQ ID NO: 661 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 661 

30 

GTAGAAYTGT GGWCGATART TRT 2 3 



35 2 ) INFORMATION FOR SEQ ID NO: 662 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebac terium dlphtherlae 

(B) STRAIN: ATCC 27010 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 662 



CGGCGCAATC CTCGTTGTTG CTGCCACCGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACGT TCTGCTCGCT CGCCAGGTCG GCGTTCCTTA CATCCTCGTT 100 

GCTCTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

C G AGATGG AG ATCCRTGAGC TGCTCGCTGA GCAGGATTAC GACGAAGAGG 200 

55 CTCCAATCAT CCACATCTCC GCACTGAAGG CTCTTGAGGG CGACGAGAAG 250 

TGGACCCAGT CCATCATCGA CCTCATGCAG GCTTGCKATG ATTCCATCCC 3 00 

AGACCCAGAG CGTGAGACCG ACAAGCCATT CCTCATGCCT ATCGAGGACA 350 

TCTTCACCAT CACCGGCCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT 400 

GGCTCCCTGA AGGTC AAC G A GGACGTCGAG ATCATCGGTA TCCGCGAGAA 450 

60 KGCTACCACC ACCACCGTTA CCGGTATCGA GATGTTCCGT AAGCTTCTCG 5 00 
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ACTACACCGA 
CGCGAAGACG 
CCCTCACACC 
GTGGCCGCCA 
CGCACCACCG 
GGTCATGCCT 
TCGCTATGGA 



GGCTGGCGAC 
TTGAGCGTGG 
GAGTTCGAGG 
CACCCCATTC 
ACGTTACCGG 
GGCGACAACG 
TGAGGGCCTG 



AACTGTGGTC 
CCAGGTTGTT 
GCTCTGTCTA 
TTCGACAACT 
TGTTGTGAAG 
TCGACATGTC 
CGCTTCGCTA 



TGCTTCTCCG 
GTTAAGCCAG 
CGTTCTGTCC 
ACCGCCCACA 
CTTCCTGAGG 
CGTCACCCTG 
TC 



TGGCGTTAAG 
GCGCTTACAC 
AAGGACGAGG 
GTTCTACTTC 
GCACCGAGAT 
ATCCAGCCTG 



550 
600 
650 
700 
750 
800 
832 



10 



15 



20 



25 



30 



35 



40 



45 



50 



2 ) INFORMATION FOR SEQ ID NO: 663 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663 



AACGGCGAGC 
CACCGTGCGT 
CCGCTGTCAC 
ACCTTGGGCC 
TCCCATCCAG 
TCACCGAGCA 
GTCGACTTGT 
TGGTGCCGGT 
TTGCCAAGGC 
ACTCGTGAGG 
CAACTTGGAG 
AGCCCCCGGG 
GAGTACTTCC 
CATTTTCAGA 
GTATCCCCTC 
TTGTTGCAGG 
GCAGGCCGTG 
CCACTTTCGC 
GAGTTGGGTA 
GTTGGACGTC 
TCCAGGAGTG 
TTGGGTATGG 
CCGTAAGATC 
TC AC TGGTAT 



AAGAC TTGGT 

ACCATTGCCA 

TGACACCGGT 

GGATCATCAA 

TGCAAGCAGA 

GTCCGTCGAG 

TGGCTCCCTA 

GTCGGTAAGA 

CCACGGTGGT 

GTAACGACTT 

GGCGACTCCA 

GGCTCGTGCC 

GTGACGAGGA 

TTCACCCAGG 

CGCCGTCGGT 

AGAGAATTAC 

TACGTCCCTG 

TCACTTGGAC 

TCTACCCCGC 

GAGGTTGTTG 

CTTGCAGGCC 

ACGAGTTGTC 

CAGCGTTTCT 

CCCCGGTAGA 



GTTGGAGGTT 

TGGACGGTAC 

GCTCCCATCT 

CGTTGTCGGT 

GAAACCCCAT 

GCTGAGGTGT 

CGCCCGTGGT 

CCGTGTTCAT 

TTCTCCGTGT 

GTACCGTGAA 

AGGTGGCCTT 

CGTGTCGCCT 

AGGCCAGGAC 

CCGGTTCCGA 

TACCAGCCCA 

CACCACCAAG 

CCGATGACTT 

GCCACCACCG 

CGTCGACCCC 

GCCAGGAGCA 

TACAAGTCGT 

CGAGCAGGAC 

TGTCGCAGCC 

TTGGTGAGAT 



TCTCAGCACT 
CGAGGGTTTG 
CGGTCCCCGT 
GAGCCCATCG 
TCACGCCGAG 
TGGAGACCGG 
GGTAAGATTG 
CCAGGAGTTG 
TCACTGGTGT 
ATGAAGGAGA 
GGTGTTCGGT 
TGACCGGTTT 
GTGTTGTTGT 
GGTGTCGGCG 
CTTTGGCCAC 
AAGGGTTCCG 
GACTGACCCT 
TGTTGTCGCG 
TTGGAC TC C A 
CTACGACGTC 
TGCAGGACAT 
AAGTTGACCG 
CTTCGCTGTC 
TGCAGGACAC 



TGGGTGAGAA 50 

GTGAGAGGTA 100 

TGGTCAGGGT 150 

ACGAGCGTGG 2 00 

CCCCCGTCTT 2 50 

TATCAAGGTT 3 00 

GTCTTTTCGG 3 50 

ATTAACAACA 4 00 

CGGTGAGCGT 450 

CCGGTGTCAT 5 00 

CAGATGAACG 550 

GACCATTGCC 600 

TCGTTGACAA 650 

TTGTTGGGTC 700 

CGACATGGGT 7 50 

TCACCTCTGT 800 

GCCCCCGCCA 850 

TGGTATCTCC 9 00 

AGTCGAGATT 950 

GCCACCGGTG 1000 

CATTGCCATT 1050 

TCGAGAGAGC 1100 

GCCGAGGTTT 1150 

CG 1192 



55 



60 



2) INFORMATION FOR SEQ ID NO: 664 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 664 

5 

AAYATGAT I A CIGGIGCIGC ICARATGGA 29 



10 2 ) INFORMATION FOR SEQ ID NO : 665 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from X00779 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665 

25 

ATGGGTAAAG AGAAGTC TC A CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TC T AC C AC T A CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 2 00 

3 0 AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 2 50 

AGTAGGAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 3 00 

AACATGATTA "CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 400 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

35 GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTC CA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTC TGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGC C ATT GACGCCATTG 700 

40 AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 7 50 

GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 9 00 

GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

45 AATCAGAAGA GGT AAC GTC T GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT AC CGTC ATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTC TGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 1200 

50 AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

AC C ATT AGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 13 00 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 

55 

2 ) INFORMATION FOR SEQ ID NO: 666 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1536 bases 

578 
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10 



45 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from M12082 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666 



ATGGTTTTGC CAAGACTATA TACTGCTACA TCCCGTGCTG CTTTTAAAGC 50 

AGCCAAACAA TCCGCTCCGC TTCTATCCAC TTCGTGGAAA AGATGTATGG 100 

15 CCTCAGCTGC TCAATCTACT CCAATCACCG GTAAAGTTAC CGCTGTCATT 150 

GGTGCCATTG TTGACGTTCA TTTTGAACAA TCAGAGTTGC CCGCTATTTT 2 00 

GAACGCTTTA GAAATTAAAA CACCTCAAGG TAAGTTGGTT TTGGAAGTTG 250 

CTCAACATTT GGGTGAAAAC ACTGTCAGAA CCATTGCTAT GGATGGTACC 300 

GAAGGTTTGG TCCGTGGTGA AAAGGTTCTT GACACTGGTG GCCCTATCTC 350 

2 0 CGTCCCAGTT GGGAGAGAAA CTTTAGGGAG AATCATCAAC GTTATCGGTG 400 

AACCTATTGA TGAAAGAGGT CCAATTAAGT CCAAACTAAG AAAGCCAATT 4 50 

CACGCAGACC CTCCTAGTTT TGCAGAACAA TCTACTTCGG CTGAAATTTT 500 

GGAAACAGGT ATCAAAGTCG TCGATCTATT AGCTCCTTAT GCCAGAGGTG 5 50 

GTAAGATTGG TCTTTTCGGT GGTGCAGGTG TCGGTAAGAC TGTGTTCATT 600 

25 CAAGAATTGA TTAACAATAT CGCCAAGGCC CATGGTGGTT TTTCCGTTTT 650 

CGCCGGTGTT GGTGAAAGGA CCAGAGAGGG TAATGACTTG T AC CGTG AAA 7 00 

TGAAGGAAAC TGGAGTCATT AACTTGGAAG GTGAATCCAA GGTCGCCTTA 750 

GTTTTCGGTC AAATGAACGA ACCTCCAGGA GCCAGAGCCA GAGTCGCTTT 800 

AAC TGGTTTG ACGATCGCTG AATATTTCAG AGATGAAGAA GGTCAAGACG 850 

30 TCTTGTTGTT TATCGACAAT ATCTTTAGAT TTACTCAAGC TGGTTCAGAA 900 

GTCTG-TGGGG— TTTTGGGTCG TATTCCATCT GCCGTCGGTT ATCAACCAAC 950 

TTTGGCCACT GATATGGGTC TCTTACAAGA AAGAATTACC ACCACAAAGA 1000 

AGGGTTCTGT CAC TTCTGTG CAAGCCGTTT ATGTTCCAGC CGATGATTTA 1050 

ACAGATCCGT CTCCGTCCAC ATCTTTTGCC CATTTGGACG CATCATCCGT 1100 

35 CTTGTCAAGA GGTATTTCAG AATTAGGTAT TTACCCTGCA GTGGATCCAT 1150 

TGGATTCTAA ATCAAGGTTA TTGGATGCCG CCGTTGTCGG TCAAGAACAT 12 00 

TATGACGTCG CCTCCAAGGT TCAAGAAACT TTACAGACCT ATAAATCTTT 1250 

ACAAGATATC ATTGCTATTT TGGGTATGGA TGAATTGTCC GAACAAGATA 13 00 

AACTAAC TGT CGAAAGGGCA AGAAAGATTC AAAGATTCTT ATCTCAACCA 1350 

40 TTTGCTGTCG CCGAAGTCTT TACTGGTATC CCAGGTAAAT TAG TGAG ATT 14 00 

AAAGGACACC GTTGCCTCGT TCAAAGCCGT TTTGGAAGGT AAATACGATA 145 0 

ATATACCAGA ACATGCTTTC TATATGGTTG GTGGTATTGA AGATGTTGTT 15 0 0 

CGTAAAGCTG AAAAGTTAGC CCGTGAAGCC AACTAG 153 6 



2 ) INFORMATION FOR SEQ ID NO: 667 



{ i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 1293 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: Y 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667 

CGGAAGGCGT GCCGCCGGTG TTAACTGCTT TGGATGTGGT GGAAAAACTT 50 

GGCCGTGACG AGCCCTTGAC ACTTGAAATT GTACAGCACT TGGACGCCCA 100 

5 CACCGGCCGT TGCATTGCGA TGCAGACGAC AGATCTGCTT AAGCTGAAAT 150 

Saaggtagt CTCGAGTGGC GGGAACATTT CTGTGCCTGT GGGTCGGGAG 200 

SScStSS GAATCTTCAA TGTGCTGGGC GATGCCATTG ACCAGCGCGG 250 

CCATGTTGGA GAGAAGCAAC GCATGCCTAT TCACGCTGTG GCACCAAAGT 300 

TGGCGGATCA GGCGGCAGAG GACACCATTC TCACAACGGG TATCAAGGTG 350 

10 ATTGATCTGA TTTTGCCCTA CTGCAAGGGT GGGAAGATTG GTCTTTTTGG 400 

TGGTGC TGGC GTGGGCAAGA C GGTTATT AT CATGGAACTT ATTAACAACG 450 

TTGC C AAGGG CCATGGTGGG TTCTCCGTCT TTGCGGGTGT TGGTGAACGC 500 

ACGCGTGAGG GGACGGATCT TTACCTGGAG ATGATGCAAT CCAAAGTTAT 550 

TGACCTGAAG GGTGACTCGA AGTGTGTGTT GGTGTATGGT CAGATGAACG 600 

15 AGCCTCCGGG TGCCCGTGCT CGTGTGGCGC AGTCTGCCTT GACAATGGCC 650 

GAGTACTTCC GTGATGTGGA AGGGCAAGAC GTGTTGCTTT TTATCGACAA 700 

CATTTTCCGC TTTACGCAGG CAAACTCTGA GGTGTCAGCG CTGTTGGGTC 750 

GTATTCCCGC TGCCGTCGGC TACCAGCCTA CCCTTGCTGA GGATCTTGGG 800 

CAGTTGCAGG AGCGCATTAC GTCCACGACA AAAGGTTCCA TTACCTCTGT 85 0 

20 GCAGGCTGTG TACGTGCCAG CCGATGATAT TACCGACCCT GCGCCAGCAA 900 

CGACCTTTTC CCACCTCGAT GCCACGACGG TGCTGGACCG TGCCGTTGCC 950 

GAATCCGGCA TTTACCCCGC TGTCAACC C A CTGGAGTGTG CCTCGCGTAT 1000 

CATGGACCCG GATGTAATCA GCGTTGACCA CTACAACGTG GCGCAGGATG 1050 

TGGTGC AGAT GCTTACCAAG TACAAGGAGC TGCAGGATAT CATTGCGGTG 1100 

25 CTTGGCATTG ATGAGCTCAG TGAGGAGGAT AAACTTATCG TGGACCGTGC 1150 

GCGTAAGGTG ACAAAGTTTC TCTCCCAGCC TTTC C AGGTG GCGGAGGTGT 1200 

TTACTGGCAT GACAGGCCAC TACGTGCAGC TGGAGGAGAC CATTGAGTCT 125 0 

TTTTCCGGCC TGTTGATGGG CACATATGAT CAGGTGCCGG AGA 1^^^ 

30 

2 ) INFORMATION FOR SEQ ID NO : 668 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: CaryneJbacterium glutamxcum 
(C) ACCESSION NUMBER: X77034 



45 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668 



ID NO: 668 






CCCCACGTAA 


ACATCGGCAC 


50 


CACCACCGCG 


GCTATCACCA 


100 


ACGAGGCTTT 


CGCCTTCGAC 


150 


CGTGGCATCA 


CGATCAACAT 


200 


CCACTACGCA 


CACGTTGACG 


250 


TGATTACCGG 


CGCTGCTCAG 


300 


ACCGACGGCC 


CAATGCCTCA 


350 


GGTTGGCGTT 


CCTTACATCC 


400 


AGGATGAGGA 


AATCATCGAG 


450 


GCTGAGCAGG 


ACTACGACGA 


500 


GAAGGCTCTT 


GAGGGCGACG 


550 


TGCAGGCTTG 


CGATGACAAC 


600 


CCATTCCTCA 


TGCCTATCGA 


650 
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GGACATCTTC ACCATCACCG GTCGTGGCAC CGTTGTTACC GGTCGTGTTG 7 00 

AGCGCGGTAC CCTGAACGTG AACGATGATG TTGACATCAT CGGCATCAAG 7 50 

GAGAAGTCCA CCTCCACCAC CGTTACCGGT ATCGAGATGT TCCGTAAGCT 800 

TCTTGACTCC GCTGAGGCTG GCGACAACTG TGGTCTGCTT CTCCGTGGTA 85 0 

5 TCAAGCGCGA AGATGTTGAG CGTGGCCAGG TTATCGTTAA GCCAGGCGCT 900 

TACACCCCTC ACACCGAGTT CGAGGGCTCT GTCTACGTTC TGTCCAAGGA 950 

TGAAGGTGGC CGCCACACCC CATTCTTCGA CAACTACCGT CCTCAGTTCT 1000 

ACTTCCGCAC CACCGACGTT ACCGGTGTTG TGAAGCTTCC AGAGGGCACC 1050 

GAGATGGTCA TGCCTGGCGA CAACGTCGAC ATGTC CGTC A CCCTGATCCA 1100 

10 GCCTGTCGCT ATGGACGAGG GCCTGCGTTT CGCTATCCGC GAAGGCTCCC 115 0 

GCACCGTTGG CGCTGGTCGT GTCACCAAGA TCATCAAGTA A 1191 



15 2 ) INFORMATION FOR SEQ ID NO: 669 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from V00267 



30 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669 



ATGGCTACTG GAAAGATTGT CCAGGTAATC GGCGCCGTAG TTGACGTCGA 5 0 

ATTCCCTCAG GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA 100 

ATGGTAATGA GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT 150 

ATCGTACGTA CCATCGCAAT GGGTTCCTCC GACGGTCTGC GTCGCGGTCT 2 00 

35 GGATGTAAAA GACCTCGAAC ACCCGATTGA AGTCCCGGTA GGTAAAGCGA 25 0 

CTCTGGGCCG TATCATGAAC GTACTGGGTG AACCGGTCGA CATGAAAGGC 30 0 

GAGATCGGTG AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA 35 0 

CGAAGAGCTG TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA 400 

TCGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT 45 0 

40 GGTGCGGGTG TAGGTAAAAC CGTAAACATG ATGGAGCTCA TTCGTAACAT 500 

CGCGATCGAG CACTCCGGTT ACTCTGTGTT TGCGGGCGTA GGTGAACGTA 550 

CTCGTGAGGG TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC 600 

GACAAAGTAT CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG 65 0 

TCTGCGCGTT GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG 70 0 

45 AAGGTCGTGA CGTTCTGCTG TTCGTTGACA ACATCTATCG TTACACCCTG 7 50 

GCCGGTACGG AAGTATCCGC ACTGCTGGGC CGTATGCCTT CAGCGGTAGG 800 

TTATCAGCCG ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTATCA 85 0 

CCTCCACCAA AACTGGTTCT ATCACCTCCG TACAGGCAGT ATACGTACCT 900 

GCGGATGACT TGACTGACCC GTCTCCGGCA ACCACCTTTG CGCACCTTGA 950 

50 CGCAACCGTG GTACTGAGCC GTCAGATCGC GTCTCTGGGT ATCTACCCGG 1000 

CCGTTGACCC GCTGGACTCC ACCAGCCGTC AGCTGGACCC GCTGGTGGTT 105 0 

GGTCAGGAAC ACTACGACAC CGCGCGTGGC GTTCAGTCCA TCCTGCAACG 1100 

TTATCAGGAA CTGAAAGACA TC ATCGC CAT CCTGGGTATG GATGAACTGT 115 0 

CTGAAGAAGA CAAACTGGTG GTAGCGCGTG CTCGTAAGAT CCAGCGCTTC 1200 

55 CTGTCCCAGC CGTTCTTCGT GGCAGAAGTA TTCACCGGTT CTCCGGGTAA 12 5 0 

ATACGTCTCC CTGAAAGACA CCATCCGTGG CTTTAAAGGC ATCATGGAAG 13 00 

GCGAATACGA TCACCTGCCG GAGCAGGCGT TCTACATGGT CGGTTCCATC 13 5 0 

GAAGAAGCTG TGGAAAAAGC CAAAAAACTT TAA 13 83 
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2) INFORMATION FOR SEQ ID NO: 67 0 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1410 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(B) STRAIN: NCTC 11638 „ 

(C) ACCESSION NUMBER: extracted from AF004014 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670 



ATGAAAGCGA 
TGTGGAGTTT 
ATTTTGAAGT 
CATTTGGGTG 
CTTAGTGCGT 
CTGTGGGCGA 
ATTGATAATT 
AAAAGCCCCT 
CCGGTATTAA 
GTAGGCTTGT 
GCTTATCCAC 
GTGTGGGGGA 
-GAAGGGGGCG 
GCCACCAGGT 
AGTATTTCCG 
ATCTTTAGAT 
TATCCCTTCA 
AACTTCAAGA 
CAAGCGGTGT 
GGTGTTTGCG 
AAAAAGGGAT 
TTAAGCCCTC 
CCAGCAAGTT 
TGGGATTGGA 
AGAAAAATTG 
TACAGGAAGT 
TTGGAGGGAT 
TACATGGTGG 
AAATTC CTAA 



TGGAAGGTAA 

GAATCCTATC 

TAATGGCGTT 

GTAATCGGGT 

AACCAAGCCG 

AGAAGTGTTA 

TAGAGCCGCT 

AGTTTTGAGC 

AGTCATTGAC 

TTGGTGGGGC 

AATGTGGCTT 

GCGCACCAGA 

TTTTAGACAA- 

GCAAGGAATC 

TGATGAAAAG 

ACGCTCAAAG 

GCGGTGGGGT 

GCGTATCGCT 

ATGTGCCAGC 

CATTTAGATG 

TTATCCGGCG 

AAATGATTGG 

TTGCAAAAAT 

CGAATTGAGC 

AGAAGTTTTT 

CCCGGTAAGT 

TTTAGAGGGC 

GCAGC ATTC A 



AATCATTCAG 
TGCCGGCGAT 
CAAAAATCTT 
GCGAGCGATT 
TCAAAGCTCG 
GGGCGTATTT 
TAAGCCGTCC 
AGCAAAGCAC 
TTGCTCGCGC 
TGGCGTAGGC 
ATAAGCATAA 
GAAGGGAACG 
-AGTTGGGT-TG- 
GCATCGCATT 
GGCTTAGATG 
CGGTGCGGAA 
ATCAGCCCAC 
TCCACTAAAA 
AG ACGAC TTG 
CG AC TACGGT 
GTGGATCCTT 
CGAGAAGCAC 
ACAAGGATTT 
GAAGAGGATA 
ATCCCAGCCG 
ATGTGACTCT 
AAATACGATC 
AGAGGTTTTA 



GTTTTAGGCC 
TTTTGAAGCA 
TAGTTTTAGA 
GCTATGGATA 
TGGCAAAATG 
TTAATGTTGT 
TTAACTTGGC 
TAAAACAGAA 
CTTATTCTAA 
AAAACGGTGA 
CGGGTATTCG 
ATCTGTATTT 
-TGGTATGGGC 
CACCGGCTTG 
TGTTGATGTT 
ATGAGCGCGC 
GCTAGCCGGG 
ATGGCTC TAT 
ACTGACCCAG 
GTTGAATAGA 
TGGATTCCAC 
TATGAAATCG 
GCAAGATATT 
AAAAAACGGT 
TTTTTTGTGG 
CCAAGAGACT 
ACATTCCTGA 
G AAAAAGC T A 



CGGTGGTAGA 5 0 

CTAGACATTA 100 

GGTGGCAGCC 150 

TG AC AG AAGG 200 

ATTGAAGTGC 250 

GGGCGAGAGC 300 

CC ATTC AC AG 350 

ATGTTTGAAA 400 

GGGCGGTAAA 450 

TCATTATGGA 5 00 

GTGTTTGCAG 550 

TGAGATGAAA 600 

AAATGAATGA 65 0 

ACGATGGCGG 7 00 

TATTGATAAC 7 50 

TATTAGGCCG 80 0 

GAAATGGGGA 850 

CACTTCGGTT 9 00 

CCCCTGCTTC 950 

AAGATCGCTG 1000 

TTCAAGGATT 1050 

CCACCGGTAT 1100 

ATTGCGATTT 1150 

TG AAAGGGC C 12 00 

CTGAAGTGTT 1250 

TTAGAGGGCT 13 00 

AAACGC GTTT 1350 

AAAACATGAA 1400 
1410 



2 ) INFORMATION FOR SEQ ID NO: 671 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium acetobutylicum 
<B) STRAIN : DSM 792 

(C) ACCESSION NUMBER: extracted from AF101055 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671 

ATGCCAGAAC ATGTAGGTAA AATTGTTCAG GTAATAGGAC CTGTTGTGGA 50 

TATTAAATTT GATGCAGAGA ACCTTCCTGA CATCTATAAT TCCATAGAAA 100 

10 TAGATATGGG AGATAATAAA AAACTCATTG CTGAAGTTGA ACAACATGTA 150 

GGAGATGACA TAGTAAGAAC AATAGCAATG GAAGGT AC TG ACGGATTAAA 2 00 

AAGAGGAATG GAAGCAGTTA ACACTGGTAA ACCAATATCT GTACCAGTTG 250 

GAGAAAATGT TTTAGGACGT CTTTTTAATG TTTTAGGTCA GACAATAGAT 3 00 

GAAGCAGGAG ACATGAATGC TGATAAGTAT TATCCAATTC ATAGACCAGC 35 0 

15 TCCAACCTTT GAAGAACAAT CAGTTCAACC AGAAATGTTT GAAACAGGTA 4 00 

TTAAGGTTAT AG ATTT AC TT GCTCCATATC AAAAGGGTGG AAAAATCGGT 45 0 

TTGTTCGGTG GTGCCGGTGT TGGTAAAACA GTTCTTATTC AGGAACTTAT 500 

AAATAATATA GCAAAAGAAC ACGGTGGATT ATCAGTATTC ACAGGTGTTG 55 0 

GAGAAAGAAC AAGAGAAGGT AATGACCTTT ATTATGAAAT GAAAGATTCA 6 00 

20 GGAGTTATAA ATAAAAC AGC TCTAGTATTT GGTCAGATGA ATGAACCACC 650 

TGGCGCTAGA ATGAGAGTTG CTTTAACAGG ACTTACAATG GCTGAATATT 7 00 

TTAGAGACAA AGGTCAAGAT GTGCTTCTAT TTATAGATAA TATATTCAGA 7 50 

TTTACACAAG CTGGTTCAGA GGTTTCAGCT TT AC TTGGT A G AATACC TAG 800 

TGCCGTTGGT TATCAGCCAA CACTTGCAAA TGAAATGGGT GCTCTTCAAG 85 0 

25 AGAGAATAAC ATCAACAAAA CAGGGTTCAA TCACATCCGT TCAGGCTGTA 900 

TATGTTCCTG CTGATGACCT TACAGACCCA GCTCCAGCAA CAACATTTAC 950 

GCATCTTGAT GCAACAACAG TTCTTTCAAG AGAAATATCA AACTTAGGAA 1000 

TATATCCTGC AGTTAGTCCT CTTGAATCAA C TTC AAG AAT ACTTGATCCA 1050 

AGAATTGTTG GAGAAGAGCA TTATGAAGTT GCTAACAAGG TTAAACATAT 1100 

30 ACTTGAAAGA TATCAAGAAC TTCAAGATAT CATAGCTATA CTTGGTGTTG 1150 
."ATGAACTTTC AGATGAGGAT AGATTGTTAG TTGGAAGAGC AAGAAGAGTA " 12 00 

CAGAGATTCT TATCTCAAGC TTTTAGTGTT GCTGAACAAT TTACAGGAAT 12 5 0 

GAAAGGTCAG TTTGT AC C TG TAAAAGATAC TATAAGAAGT TTTAAAGAAA 13 00 

TATTAGATGG TAAGTGTGAT GATCTTCCAG AAGCTGCATT TTTATTTGCA 13 50 

3 5 GGAACAATAG AAGATGTAAA AG AAAAAGC T AAAAAAATGA TGGAAAGCTA 1400 

A 1401 



40 2 ) INFORMATION FOR SEQ ID NO: 672 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1509 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cytophaga lytica 

(B) STRAIN: DSM 2039 

(C) ACCESSION NUMBER: M22535 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 672 

ATGTCTAAAG TTACAGGTAA AGTTTCCCAA ATT ATTGGC C CAGTTATAGA 50 

TGTGGAGTTC CAAGCAGGGG TAGATCTTCC AAAAATTTAT GATTCATTAG 100 

AAATTAAAAA AGC AG ATGG A TCAATTTTGG TTTTGGAAGT ACAATCACAC 150 

60 ATTGGTGAGA ACACAGTAAG AACTATATCT ATGGATTCTT CTGATGGTTT 200 

583 



CA 02307010 2000-05-19 



AAGTCGTGGA GCAGAGGTTA ATGCAACAGG 
TTGGAGATGA CGTTTACGGA CGTTTATTTA 
GATGGTCTTG GGAATTTACC AAAATCTGGT 
CAGAGAGGCA CCAAAATTTG AAGACTTATC 
TTACAGGTAT TAAAGTAATT GACCTTATTG 
AAGATTGGTT TATTTGGAGG TGCCGGAGTA 
GGAATTAATT AACAACATTG CAAAAGGTCA 
CTGGTGTAGG TGAGCGTACT CGTGAGGGTA 
TTAGAGTCTG GTATTATTAA ATACGGAGAT 
AGAAGGTGGT TGGGATTTAT CTAAAGTTGA 
CTAAAGCAAC CTTTGTATTT GGACAAATGA 
GCACGTGTTG CATTATCTGG TTTAAC TATT 
AGCAGGTGAA GGTCAAGGTA AAGATGTACT 
TCCGTTTTAC ACAAGCTGGT TCTGAGGTAT 
CCATCTGCGG T AGGTTAC C A ACCAACATTA 
GCAAGAGCGT ATTACATCAA CAAAAAGAGG 
CGGTTTACGT ACCTGCGGAT GATTTAACGG 
TTTGCTCACT TAGATGCAAC AACGGTATTG 
AGGTATTTAC CCAGCGGTAG ATC C ATTAG A 
CTCCAGAAAT TTTAGGAAAA GATCACTACT 
GAGTTGTTAC AACGTTATAA AGAATTACAA 
TATGGAAGAA TTATCTGAGG AAGATAAAAT 
GTGTACAACG TTTCTTATCT CAGCCTTTCC 
GGTC TTAAAG GTGTTTTAGT AGATATCAAG 
TATGATTATG GATGGTGAAT TAGATCACTT 
TTAAAGGTAC TATTGAAGAA GCTATAGAAG 
GAAGCATAA 



AAGCGCTATA CAAATGCCAG 250 

ACGTAATTGG AG AC GC TATT 3 00 

AAAGATGGTT TGCCAATACA 350 

TACTTCTACA GAAGTATTAT 400 

AGCCTTATGC AAAAGGTGGT 4 50 

GGTAAAACAG TATTAATTCA 500 

CGGTGGACTT TCTGTATTTG 550 

ACGATTTACT ACGTGAGATG 600 

GACTTTATGC ACTCTATGGA 650 

TAAATCTGTA ATGAAAGATT 7 00 

ATGAGCCACC AGGAGCACGT 7 50 

GCAGAATATT TCCGTGATGG 800 

TTTCTTTGTG GATAACATTT 850 

CTGCATTACT TGGTCGTATG 9 00 

GCAACAGAAA TGGGTGCTAT 950 

TTCTATTACA TCTGTACAGG 1000 

ATCCAGCACC GGCAACTACC 1050 

TCTCGTAAAA TTGCAGAGTT 1100 

TTCTACTTCT AGAATCTTAG 1150 

CTTGTGCACA AC GTGTAAAA 1200 

GATATTATTG CTATCCTTGG 1250 

GGCAGTTGGT AGAGCAAGAC 13 00 

ACGTAGCAGA GCAATTTACA 13 50 

GATACTATTA AAGGATTTAA 1400 

AC C AG AATCT GCATTTAACC 1450 

CAGGAGAAAA AATGC TTGCT 1500 

1509 



2 ) INFORMATION FOR SEQ ID NO: 673 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia risticii 

(B) STRAIN: HRC-IL 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673 

CCTAAAATAT ATGATGCATT ATATGTAAAA CTAGATAATG AAAATTTGTG 50 

TTTAGAAGTT TCACAAATTA TTGGAGATAA TGTTGTTAGA TGTATTGCAA 100 

TGGGAGCTAC TTATGGATTA AATCGTGGTT TAGAAGTAGT TTGTTCAGGA 150 

AATCCAATTC AGGTTCCTGT AGGTGAACAA GTTTTAGGTA GAATGTTTAA 2 00 

TGTTGTTGGT AAAACAATTG ACAATCTTGA ATCTTTAGAT GATAAAAATA 2 50 

TAAAAATGAT GCCAATTCAT CGAAATCCAC CATCATTTGA AGAGCAATCC 3 00 

AATGAAATTG AAATTTTTGA AACAGGCATT AAAGTTATTG ATTTATTAAT 3 50 

TCCATATGCT AAAGGTGGTA AGATTGGATT ATTTGGTGGA GCAGGGGTTG 400 

GGAAAACGGT TCTTGTTCAA GAATTAATTC ACAATATCGC AAAAGGTCAT 450 

GGTGGTCTAT CTGTTTTTGC TGGAGTTGGT GAAAGAACTC GTGAAGGTAA 500 

TGAC TTGT AT TATGAAATGA TTGAAGGTGG AGTTATAGAT AAAACAGCCT 550 

TAGTGTTTGG GCAAATGAAT GAACCTCCTG GCGCAAGAAT GCGCGTAGCA 600 

TTAACTGCTT TAACAATGGC TGAATATTTC CGTGATGTTC AAAACCAAGA 650 

TGTTTTGTTA TTTATTGATA ATATCTTTAG ATTTACACAA GC TGGTAGTG 7 00 
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AAGTTTCAGC ATTATTAGGA AGAATGCCAT CTGCTGTTGG TTATCAACCA 7 50 

ACTTTGGCAT ATGAAATGGG ATTGTTACAA GAAAGAATCA CTTCCACTAA 800 
AAGTGGTTCT ATAACATCT 819 



2 ) INFORMATION FOR SEQ ID NO: 674 

{ i ) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 



20 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674 



AGAGCGAAGT ACCAAGTGTT TACGATGCTC TGAATGTTGT GGATTCCAAA 50 

GAACGTCTGG TTCTGGAAGT TCAACAGCAG CTAGGCGGTG GCGTGATTCG 100 

25 CGCTATCGTT ATGGGTAGCT CGGATGGTTT ACGTCGTGGA ATGACAGTAC 15 0 

AAAAC AC TGG CGCTCCAATT TCAGTACCAG T AGGT AC T AA AACCCTAGGT 2 00 

CGTATCATGA ACGTGCTTGG TGATGCGATT GACGAACGTG GCGACATTGG 2 50 

CGCAGAAGAG GTGTACTCGA TTCACCGTCC TGCTCCAAGC TACGAAGAAC 300 

AGTCTAGTGC AACTGAACTT TTGGAAACGG GTGTTAAGGT TATCGACCTG 3 50 

3 0 ATCTGTCCGT TTGCGAAAGG C GGT AAAATC GGTCTGTTCG GTGGTGCGGG 400. 

TGTAGGTAAG— ACCGTTAACA TGATGGAACT -TATCAACAAC ATCGCGCTAC 450 

AGCACTCAGG TTTGTCAGTA TTTGCTGGGG TAGGTGAGCG TACTCGTGAG 500 

GGTAACGACT TCTACCACGA AATGCAGGAA GCGGGCGTTG TAAACGTTGA 550 

AC AAC C AG AA CTGTCGAAAG TAGCGATGGT TTACGGTCAG ATGAACGAGC 600 

35 CACCAGGCAA CCGTCTGCGT GTAGCAC TGA CTGGTCTGAC TATGGCGGAA 65 0 

AAGTTCCGTG ATGAAGGCCG TGACGTACTG CTGTTTATCG ACAACATCTA 700 

CCGTTACACC CTAGCGGGAA C GGAAGTATC TGCTCTGCTT GGCCGTATGC 75 0 

CTTCAGCGGT AGGTT AC C AA C C AAC AC TGG CTGAAGAGAT GGGTGTTCTG 800 

CAAGAACGTA TCACGTCAAC CAAAAAAGGT TCTATCACCT 840 

40 

2 ) INFORMATION FOR SEQ ID NO: 675 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 
55 (B) STRAIN: ATCC 25870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675 

CGGCGGTATC CTAGTTGTAG CGGCAACTGA CGGTCCAATG CCACAAACTC 50 
60 GTGAGCACAT CCTGCTGGGT CGCCAAGTAG GTATTC CTTA CATCATCGTA 100 
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TTCATGAACA AGTGTGACAT GGTTGACGAT GAAGAGCTTC TAGAGCTGGT 150 

Igaga?g^aa gttcgtgagc tgctgtctga gtacgatttc ccaggtgatg 200 

ACCTGCCAGT AATCCAAGGT TCAGCACTAG GCGCGCTAAA CGGCGAAGCA 250 

?agtgggaag CGAAGATTGT TGAGCTAGCA GAAGCACTGG ATACTTAYAT 3 00 

* SSSSgcS GAGCGTGCAG TAGACATGGC ATTCCTGATG CCAATCGAAG 350 

SctItTCTC AATCCAAGGT CGTGGTACAG TAGTAACTGG CCGTATCGAG 400 

£SSSS£ TGAAAGTGGG TG AC GAAGT A GCGATCGTTG GTATCAAAGA 450 

GACAGTAAAA ACGACCTGTA CAGGTGTAGA GATGTTCCGT AAGCTGCTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG CACTRCTACG TGGTACTAAG 550 

m SgtgISgaag TAGAGCGTGG TCAAGTACTG GCGAAGCCAG GTTCAATCAC 600 

SSSJScT AAGTTCGAAT CAGAAGTATA CGTACTGTCA AAAGATGAAG 650 

??G?CCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

?OTA?aaSg aSgtAACAGG CAGCATTGAG CTACCAGARG GCGTAGAAAT 750 

GGTAATGCCA GGCGACAACG TGAAGATGGT TGTAGACCTG ATTGCACCAA 800 

15 TCGCGATGGA CGAAGGTCTA CGCTTCGC 



2) INFORMATION FOR SEQ ID NO: 676 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Leishmania enriettxi 

— (B) * STRAIN : ATCC 50120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 6 

35 CTTCTCGGAT GGCGTGCCGC CGGTGCTGAC GGCGCTGGAC GTGACGGAGG 50 

AACTCGGGCG CGACGAGCCG CTGACGCTAG AGATCGTGCA ACACTTGGAC 100 

??SScACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TACTGAAGCT 150 

gaaStcSaag GTCGTGTCGA CCGGCGGGAA CATCTCTGTG CCTGTGGGCC 200 

GCGAGACGCT TGGCCGCATC TTCAACGTAC TGGGCGACGC GATTGACCAG 250 

40 CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CGGAGGCGCC 300 

Saagctcgcg GACCAGGCGG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AAGTGATCGA TCTGATCCTG CCGTACTGCA AGGGCGGCAA GATCGGTCTG 400 

?T?GGCGG?S CCGGTGTGGG GAAGACTGTG ATCATCATGG AGCTGATCAA 450 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTCGCC GGCGTTGGGG 500 

45 AGCGCACCCG CGAGGGGACG G ATCTGT AC C TGGAGATGAT GCAATCGAAG 550 

5?Satcgacc TGAAAGGTGA GTCGAAGTGT GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCCGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

Ga2aaSaTCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CAGCGCTGCT 750 

so gSg^cgcatc CCCGCCGCTG TGGGCTACCA ACCGACGCTT GCGGAGGATC 800 

?CG^CAt5?t GCAGGAGCGC ATCACGTCGA CGACGAAGGG GTCGATCACG 850 

TCTGTGCAGG CGGTGTACGT GCCTGCGGAT GATATCACGG ATCCCGCGCC 900 

gg?ga?Saca TTCTCGCACC TGGACGCGAC GACGGTGCTG GACCGCGCGG 950 

TGGCGGAGTC TGGCATCTAC CCCGCGGTCA AC C C ACTGG A GTGCGCGTCG 1000 

55 ?G?AT?ATGG ACCCTGATGT GATCGACGTG GATCACTACA ACGTTGCGCA 1050 

GGACATTGTT CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

2gStc?ttS TAtSgACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGTGCGCGCA AGGTGACCCG GTTCCTGTCA CAGCCGTTCC AGGTTGCGGA 1200 

GGTGTTTACT GGCATGACAG GCCATTACGT ACAGCTGAGC GACACGGTGG 1250 

60 AGTCGTTCTC CGGTTTGCTG ATGGGGTCGT ACGACCAGAT TCCGGAGA l^a 
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2 ) INFORMATION FOR SEQ ID NO : 677 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Babesia mlcrotti 

(B) STRAIN: Persing-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677 

20 CAAGCTCAAG TCTGAGCGTG AGAGAGGTAT TAC TATTGAC ATTACTCTCT 50 

GGAAATTTGA GACCCAGAAA TACGAGTACA CTGTCATAGA CGCACCTGGT 100 

CATCGTGACT TTATCAAAAA TATGATTACT GGGACTTCAC AAGCCGACGT 150 

TGCTATGCTT GTCGTTCCTG CTGAATCTGG CGGATTCGAG GCTGCTTTTT 2 00 

CCAAAGAAGG TCAGACCCGT GAACACGCCT TACTAGCCTT C AC AC TTGGC 250 

25 GTCAAACAGA TGATTGTTGC TATTAACAAA ATGGATTCTT GTCAGTACAA 3 00 

GGAGGATCGT TATATGGAAA TTTTCAAGGA AGTACAGCAG TACTTGAAGA 3 50 

AGGTGGGTTA CAAAGTTGAA AGCGTGCCGT TTGTTGCTAT TTCAGGATTC 400 

CACGGTGACA ACATGGTTGA AAAATCT AC T AACATGCCTT GGTATAAGGG 450 

TAAGACCCTC GTAGAGGCAC TTGATCAAAT GGAGCCTCCA AAACGTCCGG 500 

3 0 TCGAAAAACC TCTTAGATTG CCCCTGCAGT CAGTCTATAA AATTGGAGGT 55 0 

ATTGGTACGG TACCAGTCGG AAGGGTCGAA ACAGGACAAC TGAAAGCAGG 600 

AATGATCATT ACTTTTGCCC CCACTGGTTT GACCACTGAA TGTAAATCTG 65 0 

TTGAAATGCA TCACGAGGTT GTGGAAGTGG CTAGCCCCGG TGATAACGTT 7 00 

GGATTTAATG TCAAGAATGT GTCTGTTAAG GATATTAAGA GAGGAAATGT 7 50 

35 GGC TTCGG AT TCGAAAAATG ACCCAGCCAA GGAAGCTACC TCTTTCTCTG 800 

CACAAGTCAT TGTACTCAAT CACCCTGGTA CCATCAAGGC CGGTTACTCA 85 0 

CCTGTGGTTG ATTGCCATAC TGCCCACATT GCTTGCAAAT TCGAATCTCT 900 

AGACACTAGG ATTGACAAGC GTACTGGCAA GACTTTGGAA GAAAATCCTA 950 

AGACTATTAA GAATGGTGAC GCTGCCATGG TGACTATGAA ACCAAATAAA 1000 

40 CCCATGGTTG TGGAAACTTT CACCGACTAC GCCCCGTTGG GCCGGTTCGC 1050 

CGTGC GTGAT ATGCGCCAAA CCGTTGCCGT CGG 1083 



45 2 ) INFORMATION FOR SEQ ID NO: 678 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Lev- 12 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678 
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TGTGCTCATT CAGGAGCTCA TTAACAACAT 
ACTCTGTCTT CACTGGTGTC GGTGAGCGAA 
TACCACGAAA TGAGGGAGAC TGGTGTTATC 
GGTCGCTCTT GGTGAGTTCT TTTTTTTCTT 
5 GTGGGCCCTG ACTAAAACTG TTTCTTCCAG 
CCCCCTGGAG CCCGTGCCCG AGTTGCCCTT 
GTACTTCCGT GACGAGGAAG GCCAGGATGT 
TTTTC CG ATT CACCCAGGCC GGTTCCGAGG 
ATCCCCTCTG CCGTCGGTTA CCAGCCCACT 
10 TATGCAGGAG CGAATGTAGG TTGCATTCTC 
TGACTTTTTT TTTCTAGTAC CACCACCAAG 



TGCCAAGGCT CACGGTGGTT 50 

CTCGTGAGGG TAACGACTTG 100 

AACCTTGAGG GTGACTCCAA 150 

CAGGCTAATT AGTCGATGAC 2 00 

TCTTCGGTCA GATGAACGAG 2 50 

ACTGGTTTGA CCATTGCCGA 3 00 

GTTGC TTTTC ATTGACAACA 3 50 

TGTCTGCCTT GCTCGGTCGT 400 

CTTTCCACCG ACATGGGTGG 450 

TGTGATTTTA CGGCAAGCCT 500 

AAGGGTTCCA TTACCTCCGT 550 

551 



15 „ n 
2 ) INFORMATION FOR SEQ ID NO: 679 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 552 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: ATCC 44104 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679 

TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

ACTCCGTCTT CACCGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

35 GGTCGCTCTT GGTGAGTTCT TTTTTTTGGG AGCTAATTAG TCGATGACGT 2 00 

GGGCCCTGAC CAAAACCGTT TCTTTCAGTC TTCGGTCAGA TGAACGAGCC 250 

CCCTGGAGCC CGTGCCCGAG TTGCTCTTAC TGGTTTGACC ATTGC CG AG T 300 

ACTTCCGTGA CGAGGAAGGC CAGGATGTGT TGCTTTTCAT TGACAACATT 350 

TTCCGATTCA CCCAGGCCGG TTCCGAGGTG TCTGCCTTGC TCGGTCGTAT 400 

40 CCCCTCTGCC GTCGGTTACC AGCCCACTCT TTCCACCGAT ATGGGTGGTA 450 

TGCAAGAGCG AATGTAAGTT GCATTTTTTG TGATTTTACG GCAAGTCTTG 500 

AC TT AC ATTT TTTTCTAGCA CCACCACCAA GAAGGGTTCC ATTACCTCCG 550 

552 

TC 



45 



2 ) INFORMATION FOR SEQ ID NO: 680 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1018 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunnlnghamella bertholletiae 

(B) STRAIN: ATCC 42115 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 680 

TCTCCCTGCT ATTTTAAACG CTCTTGAAGT TAAGGATCAT GCTGGTGGTC 5 0 

GTCTTGTTCT TGAAGTTGCT CAACACTTGG GTGAAAACAC TGTACGTACT 100 

5 ATTGCTATGG ATGGT AC TG A AGGTAAGTTT ATTTTTAGAT GATCATAAAT 150 

AATTGATCAT AATGATAAAA AAAAAAGAAG AAGAAGAACA GGATGTATAT 2 00 

AATGGTTAAT AAATAATATT TTCATATTGK ATATAACTAT TTAATC TGTT 25 0 

TTTTTTTCTT CATGATTATA TATATATATG TMCTAATATC TAATATGAAC 300 

CTTTTTTATA AAATTAATCA GGTCTTGTCC GTGGTCAAAA GGTTGTTGAT 3 50 

10 ACTGGTGCTC C TATTAC CAT TCCTGTTGGT AAGGAAGTTC TTGGTCGTAT 400 

CATCAACGTT ATTGGTGAAC CCATTGATGA ACGTGGTCCT ATTAAGGCCA 450 

AGTCTCACCG TGCTATTCAC GCTGAAGCTC CAGAATTCGT TGATCAATCT 5 00 

CCTACTCCCG AAATTCTTGA AACTGGTATT AAGGTTGTCG ATTTATTAGC 550 

TCCTTATGCT CGTGGTGGTA AGATTGGTCT TTTCGGTGGT GCTGGTGTAG 600 

15 GTAAAAC TGT CTTGATTCAA GAACTTATTA ACAACATTGC T AAAGC C CAT 65 0 

GGTGGTTACT CTATTTTCTG TGGTGTTGGT GAACGTACTC GTGAAGGTAA 700 

CGATTTATAC CACGAAATGA TGGAAACTGG TGTCATTAAA CTTGAAGGTG 750 

ACTCCAAGTG TGCTCTTGTA TTCGGTCAAA TGAACGAACC TCCTGGTGCT 800 

CGTGCCCGTG TTGCTTTAAC TGGTTTAACC ATTGCTGAAT ACTTCCGTGA 850 

20 TGAAGAAGGT CAAGATGTGT TACTTTTCAT TGATAACATT TTCCGTTTCA 900 

CTCAAGCTGG TTCTGAAGTA TCTGCCCTTT TAGGTCGTAT TCCATCTGCT 950 

GTAGGTTACC AACCCACTTT ATCT AC TGAT ATGGGTGGTA TGCAAGAACG 1000 

TATTAC TACT ACCAAGAA 1018 



25 



2) INFORMATION FOR SEQ ID NO: 681 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681 
GGISSITTYG GIISIGGIAA RAC 23 

40 

2 ) INFORMATION FOR SEQ ID NO: 682 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682 
5 5 GTIACIGGYT CYTCRAARTT ICCICC 2 6 



60 



2 ) INFORMATION FOR SEQ ID NO : 683 

589 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683 
GTIACIGGIT CISWIAWRTC ICCICC 



15 2 ) INFORMATION FOR SEQ ID NO: 684 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3267 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(C) ACCESSION NUMBER: M64984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684 

30 ATCfCTGGTG CTTTAGAAAA CGCAAGAAAA GAAATTAAAC GTCTTTCATT 50 

££tga?aS aatcaatccc AATATGGTCA AATCTATTCT GTTTCCGGTC 100 

cSSttgttat TGCCGAAAAC ATGATTGGAT GTGCCATGTA CGAATTGGTT 150 

aSSggtc atgataattt AGTTGGGGAA GTTATTAGAA TTAATGGTGA 200 

35 tI^gcSac? SSSaott? ATGAAGAAAC tgcaggggtc ACTGTTGGTG 250 

ATCCAGTTTT AAGAACTGGT AAACCATTAT CTGTTGAATT AGGTCCTGGT 300 

TTAATGGAAA CTATTTATGA TGGTATTCAA AGACCTTTAA AAGCCATTAA 350 

Iga^gaatcc CAATCTATTT ATATCCCAAG AGGTATTGAT GTTCCTGCTT 400 

Jatcaa^aac tgttcaatat GATTTCACTC CAGGTCAATT GAAAGTTGGT 450 

40 GATCATATCA CTGGTGGGGA CATTTTTGGT TCTATTTATG AAAACTCTTT 500 

ATTGGATGAC CATAAGATTT TGTTACCTCC AAGAGCAAGA GGT AC T ATT A 550 

C?T?TA?TCC TGAAGCCGGT TCTTATAATG TTGAAGAACC AGTTTTGGAA 600 

GTTGAATTTG ATGGTAAGAA ACATAAATAC TCTATGATGC ATACATGGCC 650 

AGTTAGAGTT CCAAGACCAG TTGCTGAAAA ATTGACTGCT GATCATCCAT 700 

45 £gSgac?S tcaa^Sagtc TTGGATTCTT TATTCCCATG TGTTCAAGGT 750 
TGTACTACTT GTATCCCAGG GGCTTTTGGT TGTGGTAAAA CTGTTATTTC 800 
tcStctttg TCCAAATTCT CCAACTCTGA TGTTATTATC TATGTTGGTT 850 
Sttcactaa AGGTACTCAA GTCATGATGG CTGATGGTGC CGACAAATCT 900 
a™aatS£ Jtgaagttgg TGACAAAGTC ATGGGTAAAG ATGGTATGCC 950 

50 AAGAGAAGTT GTTGGCTTAC CAAGAGGTTA TGATGATATG TACAAGGTTC 1000 

OTCAACTTTC TAGTACTAGA CGTAATGCTA AATCCGAAGG CTTGATGGAT 1050 

TTCACTGTTT CTGCTGATCA TAAACTTATC TTGAAAACTA AACAAGATGT 1100 

CAAgIStcCT ACACGTAAAA TTGGTGGCAA CACCTATACT GGTGTTACTT 1150 

TCTATGTOTT GGAAAAGACT AAGACTGGTA TTGAATTAGT TAAAGCCAAG 1200 

55 ACTAAAGTTT TCGGTCATCA TATCCATGGT CAAAATGGCG CTGAAGAAAA 1250 

AGC TGC T AC T TTTGCTGCTG GCATTGACTC TAAAGAATAC ATTGATTGGA 1300 

TCATTGAAGC TAGAGATTAT GTACAAGTTG ATGAAATTGT CAAGACCAGC 1350 

ISScAAA TGATCAACCC AGTTCATTTT GAATCTGGTA AACTCGGTAA 1400 

StSgTTACAC GAACACAAGC AAAACAAATC ACTTGCTCCA CAATTGGGTT 1450 

60 aS?g?tggg TACTTGGGCT GGTATTGGAA ATGTTAAATC TTCTGCTTTC 1500 
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ACCATGAACT 
CTCTTCAAAA 
ATGTCGCTGA 
GATGAAGCTG 
5 ATTGACTATC 
TGTGGAATGC 
GTCAAGAGTA 
ATCTTTGATT 
CCAATGGTTC 

10 AG AGGTC TTG 
TATTAAAGAT 
GTATTGTCAA 
GCACTTGCAA 
TTTGTTCAAC 

15 TTACTTTGGC 
TTGGTGCACA 
GGAATTCCCA 
TGAAACGTAC 
AG AGAAGC TT 

2 0 TC AAGGTAAG 

AAGCTTTGAG 
GGTTTCCCAG 
CGGTAAAGCC 
TTGTTGCTGC 
25 ACTTCTACTT 
GGCCCAAAGA 
AATACACCAA 
C C AC AATTG A 
GGAACAAGTT 

3 0 AGATTACTTT 

AATGGTTATT 
TATGATGAGA 
CCAATGGTGC 
CATGCTGTTT 
35 AGGTGAAAAA 
CTGAAGCTTC 



CCAAAGATGA 

TTGGGCATGA 

AAACGAAGAA 

GTGATTTCAC 

AATGTTCATG 

TTTGAAATCT 

TTCCTCAACA 

GCCGGTTTAG 

TATTGAAGCT 

TCAAGATTGC 

ACTCACATTG 

TTTGACTGGT 

GAAACCAAAC 

TTTGATTTGA 

TGAAGAAACT 

ACTGTGGTGA 

GAATTGTTTA 

CACTTTGGTT 

CTATTTATAC 

AATGTTTCTA 

AGAAATTTCT 

CTTATTTGGG 

ACTGCTTTGG 

TGTTTCTCCA 

TGGGTATTAC 

AAACATTTCC 

TGTTTTGAAC 

GAGACAAAAT 

GTTCAATTAG 

AGATGTTGCT 

CTTCATATGA 

GCATTTATTT 

TCAATGGTCT 

CTTCAGCTAA 

GAATTTGGAG 

AGAATAA 



TGTTAAATTA 

CTTGTTCTTC 

GAATTTTTCA 

TTTTGATGAA 

GTGCAGCTGC 

CTTGGTTTCA 

TATTGCTGTT 

TTGATGCTGC 

GTTGTTAGAA 

TCATTCTTTG 

ATGCTGCTGG 

GCTCCACTTG 

TCCAGTTGTC 

TCAAATCTGC 

GATCATCAAT 

ACGTGGTAAT 

CTGAAATTTC 

GCCAATACTT 

TGGTATTACA 

TGATTGCTGA 

GGTAGATTGG 

TGCTAAATTG 

GTTCACCAGA 

GCTGGTGGTG 

TCAAGTTTTC 

CATCTATTAA 

AAATACTATG 

TAGAGAAATT 

TTGGTAAATC 

AC C TTG ATT A 

TGCATTCTGT 

CATATTATGA 

AAATTAGCTG 

ATTCTTTGAA 

ATTTATTAAC 



GCTACAAGAA 

TACTGAATCC 

ATAACCTTGG 

TTTACCGATG 

AAGCAAGAAG 

GAGCCAAGTC 

GATGATATTG 

TGGTAATGTT 

CTTC TTTC AG 

GGTATTGAAT 

TGTTAGACAA 

CTGGTGTTCT 

AAATTTAC CA 

AAAAGAAAAC 

TCCTTTTATC 

G AG ATGGC TG 

TGGTAGAAAA 

CTAATATGCC 

TTGGCTGAAT 

TTCTTCTTCA 

GTGAAATGCC 

GCTTCTTTCT 

TAGAGTTGGT 

ATTTCTCTGA 

TGGGGGTTGG 

CACCAGTGTT 

ATTCCAACTA 

TTATCTAATG 

TGCATTGTCT 

AAGAAGATTT 

C C AATTTGG A 

TGAAGCACAA 

AAAGTAC TAG 

CCATCAAGAG 

CACTATCTCC 



TTATGAACTA 155 0 

GGTGAAC TC A 1600 

TGCTGAAAAG 165 0 

CTATGGATGA 17 00 

AACAATTTGT 17 50 

TACTGATATT 1800 

TTGTCAGAGA 1850 

GAAACCAAAT 19 0 0 

ACATGTCGCT 1950 

CATCTATTAA 2000 

GAATTTGCTT 2050 

TTC TAAATGT 2100 

GAGACCCAGT 2150 

TATTATGGTA 22 00 

CAACATGGCC 2250 

AAGTTTTGAT 23 00 

GAACCAATTA 23 50 

AGTCGCTGCC 2400 

ATTTCAGAGA 2450 

CGTTGGGCTG 2500 

TGCTGATCAA 2550 

ATGAGCGTGC 2600 

TCAGTTTCTA 265 0 

TCCAGTTACT 2700 

ATAAGAAATT 27 5 0 

TCTTATTCTA 2 800 

TCCAGAATTC 285 0 

CTGAAGAATT 29 00 

GATTCTGATA 2950 

CTTGCAACAA 3000 
"AG AC TTTTG A ' 3 05 0 

AAAGCAATTG 3100 

TGATGTTAAA 315 0 

GTCAAAAAGA 32 00 

GAAAGATTTG 3250 
3267 



40 2 ) INFORMATION FOR SEQ ID NO: 685 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1782 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 <vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 9790 

(C) ACCESSION NUMBER: D17462 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 685 

TTGCAAATTG GAAAAATCAT AAAAGTCTCC GGTCCTCTCG TT ATGGC AGA 5 0 

AAATATGTCA GAAGCAAGTA TTC AAG AC AT GTGTTTAGTG GGAGATTTAG 100 

GAGTCATCGG CGAAATCATT GAGATGCGTC AAGATGTGGC GTCTATTCAA 150 

60 GTATATGAAG AAACTTCAGG AATTGGTCCC GGAGAACCTG TTCGTTC C AC 200 
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40 



45 



TOrPPAACCA CTATCTGTTG AGCTAGGACC AGGAATCATT TCACAAATGT 250 

TTGACGGGAT TCAAAGACCA CTGGATACAT TTATGGAAGT GACTCAAAGT 300 

AACTTCTTAG GACGTGGGGT CCAATTACCA GCTTTAGATC ATGAGAAACA 350 

ATT c TGGTTT GAAGCGACAA TCGAAGAAGG AACAGAAGTA AGTGC TGG AG 400 

5 SSSSSS GTACGTGGAT GAAACGAAGA TCATTCAGCA CAAAATCATG 450 

GTCCCTAATG GTATCAAAGG AACTGTACAA AAAATTGAAT CTGGATCATT 500 

TACGATCGAT GATCCGATTT GTGTGATCGA AACGGAACAA GGCTTAAAAG 550 

Igctgacgat GATGCAAAAA TGGCCAGTAC GTCGTGGTCG ACCAATCAAA 600 

CAAAAATTAA ATCCAGATGT ACCGATGATC ACCGGTCAAA GGGTCATTGA 650 

10 CACGTTTTTC CCAGTAACTA AAGGAGGAGC GGCAGCCGTT CCAGGTCCGT 700 

10 S^S GaISSaGTT GTGCAACACC AGATTGCTAA GTGGTCGGAC 750 
GTAGATCTAG TGGTTT ACGT TGGTTGTGGG GAACGAGGAA ATGAAATGAC 800 
PCATGTCGTC AATGAATTTC CTGAACTGAT CGATCCAAAT ACAGGCGAGT 850 
CTTTGATGGA ACGAACTGTG TTGATCGCTA ATACATCGAA CATGCCAGTA 900 
15 §SS5S»0 AAGCTTC TAT TTATACGGGA ATCACGATTG CCGAGTACTT 950 

CCGTGACATG GGGTATGATG TAGCAATCAT GGCAGATTCC ACTTCTCGTT 1000 

SgSgSgc ACTGCGTGAA ATGAGCGGAC GTTTAGAAGA AATGCCTGGT 1050 

GATGAAGGTT ATCCCGCTTA TCTGGGCTCT CGTCTAGCTG AATACTATGA 1100 

ACGTTCAGGA CGTGTCATTG CTCTAGGCTC TGACCAACGT GAGGGCAGTA 1150 

20 tcaSS^cat CAGTGCGGTT TCTCCTTCTG GTGGAGATAT CTCTGAACCA 1200 

GTGACTCAAA ATACCTTACG TGTGGTGAAG GTTTTCTGGG GATTAGATTC 1250 

CAAAAAAGAC ATTTTCC ATC GATTAACTGG ATCCAAAGTT 1300 

ACTCATTATA TTCAACAGAA GTTGGCAGAT ATATGGATCA AATCTTACAA 1350 

CAGGATTGGT CTGATATGGT AACTGAAGGC ATGCGGATCT TGCAAGAAGA 1400 

AGAACAATTA AATGAAATCG TGCGCTTGGT AGGGATCGAT TCGC TTTCTG 1450 

A?A^CG^TCG CTTGACCCTT GAAGTAGCAA AATCGATTCG AGAAGACTAT 1500 

TTACAACAAA ATGCTTTTGA TGATGTAGAT ACGTTTACTT CAAGAGAAAA 1550 

ACAATTCAAC ATGTTGAAAG TTATTTTGAC TTTTGGGAAA GAAGCTCGAA 1600 

AAGCCTTATC OTtSgGAGCG TACTTCAATG AAATCATGGA AGGTACAGTA 1650 

, n PPPGTCAGAG AACGCATTAG TCGGAGCAAG TATATTCCAG AAGAAGAGTT 1700 

30 AGTAGTATAA ATGAAGAAAT CAAAGAAACG ATCCAATTGA 1750 

TTGTTTCAGA AGGAGGGATG ACCGATGATT AA 

35 2 ) INFORMATION FOR SEQ ID NO: 686 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1781 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686 

50 CAGGGACATG TTATAGAAGC TTATGGAAAC TTGTTACGTG TACGCTTTGA 50 
cg5a?atgtt AGACAAGGTG AAGTTGCATA TGTCAACGTA GATAATACCT 100 
GGTTAAAAGC AGAAGTGATT GAAGTTGCTG ATCAAGAAGT CAAGGTTCAG 150 
GTATTTGAAG ATACACAAGG CGCGTGTCGA GGAGCTCTTG TTACGTTTTC 2 00 

AGGACATCTT TTAGAAGCCG AGTTAGGGCC TGGCTTGCTT CAGGGCATTT 250 
TCGATGGACT TCAAAATCGT CTTGAGGTGC TAGCTGAAGA TAGTTCTTTC 300 
TTGCAGAGAG GCAAGCATGT TAATGCTATT TCTGATCATA ATTTATGGAA 350 
TTATACTCCC GTAGCTTCTG TTGGGGATAC TTTAAGACGA GGAGATCTTC 400 
TAGGAACAGT ACCTGAAGGA CGATTTACTC ATAAGATTAT GGTTC CTTTT 450 

60 TCTTGCTTTC AAGAGGTTAC CCTGACTTGG GTAATTTCTG AAGGAACCTA 500 

592 
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TAATGCTCAT 
GTGCCTTTAC 
GGAGAGAAGA 
AGATACGCAA 
5 CTTTTGGTGC 
GCTGTAGATA 
TGTTGAGGTA 
AGTCTTTAAT 
GTGGCTGCCC 

10 CTATCGCCAG 
GATGGGCACA 
GGAGAGGAAG 
TGAGCGAGGA 
CTATATGTGG 

15 ACTCAATCTA 
ACGAGCTGAC 
CAAAATATTT 
TGGGGTGGTG 
AATCGGCAAG 

20 ACATGGAAAT 
CAGAACGCAT 
AGAGTTATTT 
ATAGTCCTGA 
AAGACATTAA 

25 AGAGGTCATA 
GATATGCAAA 



ACTGTGGTCG 

TATGGTGCAA 

TCCCTGCGCA 

ATTCCAGTAT 

AGGGAAAACA 

TTGTGATTTT 

TTACAAGAGT 

GCACAGAACA 

GAGAGTCTTC 

ATGGGACTAG 

AGCCCTTAGA 
CATTTCCTGC 

GGAGCTATCA 
TGCGGTGTCT 
CATTAGCTGT 
GCACGTAGGT 
GAACCAGGTA 
CTGTGAAAAA 
CGTATGGAAG 
CTACTTAAAG 
TCGATCCTGT 
TCATTAATCA 
TGATGCAAGA 
ATGGCCTGAA 
GTTAGACTGT 
CAATCTACAC 



CAAAAGCTCG 
AGATGGCCGA 
TAAGATTATG 
TGAAGGGGGG 
GTCTTACAAC 
GTGTGCGTGC 
TCCCTCATCT 
TGTATTATTT 
GATCTATTTA 
ATATTCTGCT 
GAGATTTCGG 
ATACCTGTCT 
CCACGAAAGA 
CCTGCAGGAG 
AGTCGGAGCG 
ATCCTTCAAT 
GGACAAATTT 
AGCAGCACAG 
TTGTCAGTGA 
GCAGAACTTT 
GGACTGTTAT 
GTCGTATTTT 
AGCTTTTTCC 
ATTTCTTTCA 
TGGAAAAAAC 
AAAAATAACT 



AGATGCTCAG 

TCAAACAAGC 

GATGTGGGTT 

AACTTTCTGT 

ACCATCTTTC 

GGAGAGCGTG 

TATCGACCCC 

GTAACACATC 

GGAGTGACGA 

TTTAGCTGAT 

GACGTCTTGA 

TC TAG AATAG 

TGGTTCTGAA 

GAAACTTTGA 

TTCTGTGGTC 

AGACCCTTTG 

TAGAAGAGAA 

TTTCTAGAGA 

AGAAGGGGTT 

ATGATTTTTG 

TGTCCTTTTG 

TGATGCTAAA 

TTGAGCTGCA 

GAGGAATATC 

AATGGTACAA 

G 



GGTAAAGAAT 5 50 

TTTTATTGAA 600 

TGCGAATCTT 650 

ACCCCAGGAC 7 00 

TAAGTACGCT 7 50 

CTGGTGAAGT 800 

CATACCGGAA 850 

ATCCATGCCT 900 

TTGCAGAATA 950 

TCTACATCCC 1000 

AG AAATCCC T 1050 

CTGCTTTTTA 1100 

GGATCTTTAA 1150 

AGAAC C AGTC 1200 

TTTC AAAAGC 12 5 0 

ATTTC TTGGT 13 00 

GGTTTC AGGC 13 50 

AAGGTTCAGA 1400 

TCTATGGAAG 1450 

TkATCTCCAG 1500 

AG AGAC AGAT 1550 

TTTGTTTTTG 1600 

G AGC AAGATT 1650 

ATGAGAGTAA 1700 

ATGGCGTAAG 17 50 
1781 



3 0 2 ) INFORMATION FOR SEQ ID NO: 687 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 58 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

{ D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Halobacterium salinarum 

(C) ACCESSION NUMBER: S56356 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 



ATGAGTCAAG CTGAAGCAAT CACTGACACC GGCGAAATCG AGAGCGTGAG 50 

CGGTCCGGTC GTGACCGCCA CGGGCCTCGA CGCGCAGATG AACGACGTCG 100 

TCTACGTGGG CGACGAGGGT CTGATGGGCG AGGTCATCGA GATCGAAGGC 150 

GACGTAACCA CCATCCAGGT CTACGAGGAG ACGTCCGGCA TCGGGCCGGG 200 

50 CCAGCCCGTC GACAACACGG GCGAACCGCT C AC CGTGG AC CTGGGCCCGG 25 0 

GGATGCTGGA CTCCATCTAC GATGGTGTGC AGCGTCCACT GGACGTCCTC 3 00 

GAAGACGAGA TGGGGGCGTT CCTCGATCGC GGTGTTG AC G CACCCGGCAT 350 

CGACCTCGAC ACCGACTGGG AGTTCGAGCC CACCGTCGAG GCGGGCGACG 400 

AGGTCGCGGC CGGCGATGTC GTCGGCACCG TCGACGAAAC GGTCAGCATC 450 

55 GAACACAAGG TTCTGGTGCC CCCCCGCAGT GACGGCGGCG AAGTCGTCGC 500 

CGTCGAATCC GGCACGTTCA CCGTCGACGA CACGGTCGTC GAGTTGGACA 550 

CCGGCGAGGA GATCCAGATG CACCAGGAGT GGCCGGTCCG CCGCCAGCGC 600 

CCCACCGTCG ACAAGCAGAC GCCGACGGAG CCGCTGGTGT CCGGCCAGCG 650 

CATCCTCGAC GGCCTGTTCC CGATCGCGAA AGGCGGGACG GCCGCGATCC 700 

60 CGGGGCCGTT CGGGTCCGGG AAGACGGTCA CCCAGCAGTC CCTCGCGAAG 750 
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TTCGCCGACG CGGACATCGT TGTCTACATC GGCTGTGGTG AGCGCGGCAA 800 

SSSSaCG gSgTCATCG AGGACTTCCC GGAGCTGCCC GACCCCCAGA 850 

cSgggaaccc GCTGATGGCC CGCACCACGC TCATCGCGAA CACGTCGAAC 900 

ATQCCGGTTG CCGGGCGTGA GTCCTGCATT TACACGGGAA TCACCATCGC 950 

5 ggag5a??ac CGCGACATGG GCTACGACGT ggcgctgatg GCCGACTCCA 1000 

2StcGCGGTG SgCGGAGGCC ATGCGGGAGA TCTCCTCGCG ACTCGAGGAG 1050 

ATGCCCGGCG AGGAGGGGTA TCCCGCGTAC CTGGCCGCCC GCCTCTCGGA 1100 

GTTCTACGAG CGCGCCGGCT ACTTCGAGAA CTTCAACGGG ACCGAGGGCT 1150 

CCATCTCGGT CATCGGTGCG GTGTCGCCGC CCGGCGGGGA CTTCTCCGAG 12 00 

10 CCGGTCACCC AGAACACGCT GCGCATCGTG AAGACGTTCT GGGCGCTTGA 1250 

CTCGGACCTC GCCGAGCGCC GGCACTTCCC GGCGATCAAC TGGGACGAGT 13 00 

CCTACAGCCT CTACAAGGAC CAACTCGACC CGTGGTTCAC GGACAACGTC 135 0 

ctSaCGACT GGGCCGAGCA GCGCCAGTCG GCGGTCGACA TCCTCGACGA 1400 

GGAATCCGAA CTCGAAGAGA TCGTGCAGCT CGTCGGGAAG GACGCGCTGC 1450 

15 CCGAGGACCA GCAGCTCACG CTGGAAGTCG CGCGGTACAT CCGCGAGGCG 1500 

TGGCTCCAGC AGAACGCGCT CCACGACGTG GATCGCTACT GCCCGCCCGA 1550 

JSgACGTAC GCCATCCTCT CCGGCATCAA GACGCTTCAC GAGGAGTCCT 1600 

^SgCGTT GGACGCCGGT GTGCCAGTCG AGGAGATCAC GTCGATCGAC 1650 

GCCGCGCCGC GCCTGAATCG TCTCGGCACG ACGCCCGACG ACGAGCACGA 1700 

20 GGCGGAGGTC GCGGAGATCA AACAGCAGAT TACCGAGCAG CTTCGGGAGC 17 50 
TCTACTGA 



25 2 ) INFORMATION FOR SEQ ID NO : 688 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3118 bases 

(B) TYPE: Nucleic acid 
( c ) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



35 



40 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human 
(C) ACCESSION NUMBER: 



L09234 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688 

GAATTCCGGC AGCTGACTAG TCTTGTGATT GGGGTCCTGG 
TCATTCCAAA TGACGAGCAC ATTGATAAAG ACGTCCGATG 
GTCCAAATTC GGCTTTGTTT TTGCCGTATC TGGACCTGTG 
AACGAATGGC CGGTTCTGCT ATGTACGAAC TGGTGCGTGT 
GAACTGGTCG GAGAGATCAT CCGGTTGGAG GGTGACATGG 
AGTATACGAA GACACCTCAG GTGTGACAGT AGGCGATCCC 
CAGGCAAGCC GCTGTCCGTG G AAC TGGGAC CCGGAATCAT 
TTCGACGGTA TCCAGCGACC GCTGAAGGAT ATCAATGAAC 
TATCTACATC CCGAAAGGTG TCAATGTGCC TGCCCTGAGT 
AGTGGG AC TT CAGTCCCGTC AGTGTCAAGG TTGGAAGCCA 
GGTGACCTGT ACGGTTTGGT CCACGAAAAT ACTCTGGTGA 
GCTGCTGCCG CCCCGTGCCA AGGGAACTGT C AC GT AC ATT 
GAAACTACAC AGTTGATGAT GTTGTCCTGG AGACAGAATT 
CGATCAAAGT TCACCATGCT GCAAGTGTGG CCTGTACGTC 
TGTTACAGAA AAGTTGC C AG CTAACTACCC CCTCCTTACT 
TGCTCGACTC CCTATTCCCG TGTGTCCAGG GTGGAACAAC 
GGGGCCTTCG GATGTGGCAA GACTGTAATA TCACAGTCTT 
CTCAAACTCC GATGTAATTA TCTATGTAGG TTGTGGTGAG 
AAATGTCAGA AGTACTCAGG GATTTCCCGC AGTTGTCGTT 
GGTGTGACTG AATCAATCAT GAAGAGAACA GCCCTGGTCG 

594 



GCTGATAAAA 50 

AGG AC CGGG A 100 

GTGACAGCTG 15 0 

CGGTTATTAT 200 

CAACAATCCA 250 

GTGCTGCGCA 3 00 

GGGCAGC ATC 3 50 

TGTCAAATAG 400 

CGCACTGCAC 450 

C ATT AC TGGT 500 

AACACAAGTT 550 

GCAGAACCTG 600 

TGACGGCGAG 650 

AGCCCAGGCC 700 

GGCCAGCGTG 7 50 

AGCTATTCCT 800 

TGTC AAAATA 85 0 

CGAGGTAATG 900 

GGAGATTGAT 950 

CAAACACATC 1000 
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AAACATGCCT GTGGCTGCTC GAGAAGCATC TATCTACACA GGTATTACAC 
TGTCAGAATA CTTCAGGGAC ATGGGTTACA ATGTATCCAT GATGGCTGAC 
TCAACTTCAC GATGGGCCGA AGCTCTTCGA GAAATCTCAG GTCGATTGGC 
TGAAATGCCT GCCGACAGCG GTTATCCCGC CTACCTAGGT GCACG AC TTG 
5 CCAGTTTCTA CGAGCGTGCC GGCCGTGTGA AGTGCTTGGG TAACCCAGAC 
AGGGAGGGCT CCGTGAGTAT AGTGGGCGCC GTGTCGCCGC CCGGTGGAGA 
CTTCTCAGAT CCCGTGACGA CGGCCACACT AGGTATCGTC CAGGTGTTCT 
GGGGTCTCGA CAAGAAACTT GCCCAGCGAA AGCACTTCCC ATCCATCAAC 
TGGCTCATCT CGTACAGTAA ATACATGCGT GCTCTGGATG ACTTCTACGA 

10 CAAGAATTTC C C AG AGTTTG TCCCACTGCG TACAAAGGTG AAGGAGATTT 
TGCAGGAGGA AGAAGACCTG TCTGAAATTG TGCAGTTGGT CGGTAAAGCT 
TCATTGGCAG AAACTGACAA GATCACACTT GAGGTTGCCA AACTATTAAA 
GGATGATTTC CTGCAACAGA ACAGCTATTC ACCATATGAC CGTTTCTGCC 
CATTCTACAA GACAGTAGGA ATGCTGAAAA ATATGATTGC TTTCTACGAT 

15 ATGTCTCGGC ATGCAGTTGA ATCTACTGCT CAGAGCGAGA ACAAGATCAC 
TTGGAATGTT ATTAGAGATT CTATGGGCAA TATTCTGTAT CAGCTTTCCT 
CCATGAAATT CAAGGATCCA GTCAAGGATG GAGAAGCGAA GATCAAGGCA 
GACTTTGAGC AGCTTCATGA AGACATTCAG CAAGCCTTCA GGAACCTGGA 
GGATTAAAGT GGTAGCTGCC AGTGGTTCTC TCGGTGCAGT TGTCACATTT 

2 0 GGCAAGCTCT GTAGGGTTGC CGAGTGGCAT CGGTGCTAGA CACCTGAGCA 

TTCCTTTGCC ACATAAAGAC TAAAGCAGGT GGAATTTCAG TTGTAAAAAG 
CTGGTTCCAT TGGTGCTAAG ATTATGTTGT GCCCTTTTCT GCTTCTCACA 
TTCCAACAGA GGAATTTACT TCCAGTTTTC TTCCATTTTC CTCCTCATTT 
TAAGTGTCGG TACAGAGGCA ATAATCTGAT AACTCTGTAC CGTC AC TTAC 
25 AAGCAGGGAG AATTTGTAAT TATTACAAAT CCCATTATCT CTGTGCACCA 
CAGCCTTGTA AATTCATTTG TCCCAGGACT CCCTCTTGTG TGTACGTGAG 
ATTGCCGTCT GTATGTATGT ACACACCGTA CTGCAGTATT TGAAGTCAGT 
CAGAAGGTGA ATTACACCAC TTACTCATTG TGTCACGTAG CAAGTGTGCA 
AACTGCCATC CATTGTCCTA TTTATTCACA TAACTAGTTT TCTTTGCATT 

3 0 -T-CCAGTGTTG CAAATTGTGT TTAGAAAATT ATGCCATCGA GACTGGTCGA 

ACCTCACATT GTAACTCAGT ATTTACACAC ACGTTTACTT GCTACAGAAA 
TGTAGAAAAA ATAATTGTTG TATATTGAAA GTACAAGTGA CAAAGTTGCA 
TTTAAAATGG TGAATGTATT TTATATTTCT TTTGTAGACA CAAGAGTTAA 
TGCATTTTGC TTAATGGAGA TGTATGTAAA CCTAAAATAG CAGTTTGTGC 

35 ACAAATTATG TATATGTGAA ATGGAGATGG TTTCTAATTT GCTGATTGAT 
TGCCAGTATT AATTTAAACA ACTGTAGTTG TGGGATGTAG TGGGAAGATT 
TTTTTTTTCC TATAAAATTG GTGGATGTAT GTGTCGGAGA TTTTGATTGT 
ATGTGTAAAA TAGTGATCCC AGTAAC TGTA AAGCTTTAGA ATACAGTTAC 
TGACTGTATA GTTGTACAGG TGTTGTTACT TTTAAGAATT TATTGACACA 

40 AAGGTGAAAG TCTATTATTG TATTGTAATG TTTAAAGCAT TTAAGGTTTA 
AAAATCC TAC TTC TGTGTAT AAATGTTACC ATTCCTCATA TAACATAACT 
GTGTAGAAAT ACAGTCAACT TCATGTTCAT TAGCATTTCA CTGTTGTCAC 
ATAAATTATG CCCGGAAT 



1050 

1100 

1150 

1200 

1250 

1300 

1350 

1400 

1450 

1500 

1550 

1600 

1650 

1700 

1750 

1800 

1850 

1900 

1950 

2000 

2050 

2100 

2150 

2200 

2250 

2300 

2350 

2400 

2450 

2500 

2550 

2600 

2650 

2700 

2750 

2800 

2850 

2900 

2950 

3000 

3050 

3100 

3118 
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2 ) INFORMATION FOR SEQ ID NO: 689 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



50 



55 



(ii) 



MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Plasmodium falciparum 

(B) STRAIN: 3D7 

(C) ACCESSION NUMBER: L08200 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689 

ATGACAAAAG TTGCTGTTGA AAAAGAGGAA CCAGGAGTTG TTTATAAGGT 50 

gSggttca TTAGTTATTG CTGAGAATAT GAGTGGAACT CGTATGTACG 100 

SSSkSS agtaggatgg AATAAATTGG TTGGAGAAAT tattagatta 150 

PAACGGAATT ATGCATATAT ACAAGTTTAT GAAGATACTT CAGGTTTATC 2 00 

TGTAGGAGAC CCTGTTATAA AAACAGGAAA TGCTTTATCA GTCGAATTGG 250 

GTCCTGGTAT TTTAGATAAT ATTTATGATG GTATTCAAAG ACCATTAGAA 300 

a?Ktagcaa ATGTGTGTGG TGATGTATAT ATATATAAAG GTATTGATAT 350 

SacItc?™ Satcatgata AACAATGGCA ATTTTATGCT GATAAGAAAT 400 

TAAaStAAA TGATATTGTT ACTGGTGGAG ATATCTTTGG ATTTGTTGAT 450 

GAAAATAAAT TATTTAAAGA ACACAAAATT ATGGCTCCAC CTAATGCTAA .500 

ATPGAGGCTT ACATATATTG CTCCAGATGG ATCATATACT TTAAAAGATA 550 

5SSmotga ATTAGAATAT CAAGGAAAAA AATATACATA TGGTTTATCT 600 

CATTTATGGC CTGTTCGTGA TCCTAGACCT GTTTTAGAAA AGGTAACAGG 650 

prATACTTTA TTATTAACAG GGCAAAGAGT TTTAGATTCG TTATTTCCAA 700 

SStoaIgG AGGTACTTGT GCTATTCCTG GTGCATTTGG TTGTGGAAAA 750 

SS^^TTT CTCAGGCCTT ATCAAAATAT TCTAATAGTG AAGTTATTAT 800 

atatc?aSgI Saggtaatga AATGGCTGAA ATTTTATCCG 850 

Sttcctga ATTAACTACT AAAGTAGATA ATGAAGATGT AGGTATTATG 900 

CAAAGAACGT GTTTAGTTGC TAATACTTCT AACATGCCTG TCGCTGCAAG 950 

ACAAGCTAGT ATTTATACAG GTATTACTTT ATGTGAATAT TTCCGTGATA 1000 

5SS??A?AA TGCTACCATG ATGGCTGATA GTACAAGTAG ATGGGCAGAA 1050 

gcSSSS aaItttcagg ACGTTTAGCT GAAATGCCTG CAGATAGTGG 1100 

TTATCCAGCT TATTTAGGTG CTAGATTAGC TTCCTTTTAT GAACGTGCAG 1150 

gIaaagtcaa ATGTATTGGT TCTCCATCTC GTATAGGATC CATTACAATT 1200 

g?5gg?gctg TGTCTCCACC AGGTGGTGAT TTCTCTGACC CTGTAACTAC 1250 

a^aaccatg TCTATTGTTC AAGCATTTTG GGGGTTAGAT AAAAAACTAG 1300 

CTCAAAGAAA ACATTTCCCT TCTGTTAATT GGTCTACATC CTTTTCAAAG 1350 

ta?g1£ag1£ AATTAGAACA ATACTTTGAT AATTTTGATC AAGATTTCTT 1400 

ATCTTTAAGA CAAAAAATTA GTGATATTTT ACAACAAGAA AGTGACTTGA 1450 

ATGATATTGT TCAACTAGTA GGAAAGGATT CATTATCAGA AG AC C AAAAA 1500 

GTTGTTATGG AAGTAGCCAA AATTATTAGA GAAGATTTTC TTCAACAAAA 1550 

TgSSaGC GATTATGATT ATATGTGCCC ATTACAAAAA ACAGTTGGTA 1600 

tStc^AAT TATTTGCCAC TTTTATGCTC AATGCTTAAG AACATTACAA 1650 

gSJaTCACT CAAGAGAAAG AAAAATTGGT TGGGGATCTA TATATAATAC 1700 

ATTAAGACCA ACTATAAATA AAATTACACA TATGAAATTT GAAAACCCAA 1750 

aSa^tcaga TGAATATTTC AAAAAGTATT TTAAGGCACT TGAAGAAGAA 1800 

ATAACAGTAG GTTTAAGAAA CTTGATGGAA AAATGA J- 0 ^ 0 



2 ) INFORMATION FOR SEQ ID NO: 690 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3216 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: . . 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: X2180-1A 

(C) ACCESSION NUMBER: J05409 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690 
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CA 02307010 2000-05-19 



ATGGCTGGTG CAATTGAAAA CGC TCGTAAG GAAATAAAAA GAATCTCATT 50 

AGAAGACCAT GCTGAATCTG AATATGGTGC CATCTATTCT GTCTCTGGTC 100 

CGGTCGTCAT TGCTGAAAAT ATGATTGGTT GTGCCATGTA CGAATTGGTC 150 

AAGGTCGGTC ACGATAACCT GGTGGGTGAA GTCATTAGAA TTGACGGTGA 2 00 

5 CAAGGCCACC ATCCAAGTTT ACGAAGAAAC TGCAGGCCTT ACGGTCGGTG 250 

ACCCTGTTTT GAGAACAGGT AAGCCTCTGT CGGTAGAATT GGGTCCTGGT 300 

CTGATGGAAA CCATTTACGA TGGTATTCAA AGACCTTTGA AAGCCATTAA 350 

GGAAGAATCG CAATCGATTT ATATCCCAAG AGGTATTGAC ACTCCAGCTT 400 

TGGATAGGAC TATCAAGTGG CAATTTACTC CGGGAAAGTT TCAAGTCGGC 450 

10 GATCATATTT C C GGTGGTG A TATTTACGGT TCCGTTTTTG AGAATTCGCT 500 
AATTTCAAGC CATAAGATTC TTTTGCCACC AAGATCAAGA GGTACAATCA 550 
CTTGGATTGC TCCAGCTGGT GAGTACACTT TGGATGAGAA GATTTTGGAA 600 
GTTGAATTTG ATGGCAAGAA GTCTGATTTC ACTCTTTACC ATACTTGGCC 650 
TGTTCGTGTT CCAAGACCAG TTACTGAAAA GTTATCTGCT GACTATCCTT 7 00 

15 TGTTAACAGG TCAAAGAGTT TTGGATGCTT TGTTTCCTTG TGTTCAAGGT 750 
GGTACGACAT GTATTCCAGG TGC TTTTGGT TGTGGTAAGA CCGTTATCTC 800 
TCAATCTTTG TCCAAGTACT CCAATTCTGA CGCCATTATC TATGTCGGGT 850 
GCTTTGCCAA GGGTACCAAT GTTTTAATGG CGGATGGGTC TATTGAATGT 900 
ATTGAAAACA TTGAGGTTGG TAATAAGGTC ATGGGTAAAG ATGGCAGACC 950 

20 TCGTGAGGTA ATTAAATTGC C C AG AGGAAG AGAAACTATG TACAGCGTCG 1000 

TGCAGAAAAG TCAGCACAGA GCCCACAAAA GTGACTCAAG TCGTGAAGTG 1050 

CCAGAATTAC TCAAGTTTAC GTGTAATGCG ACCCATGAGT TGGTTGTTAG 1100 

AACACCTCGT AGTGTCCGCC GTTTGTCTCG TACCATTAAG GGTGTCGAAT 1150 

ATTTTGAAGT TATTAC TTTT GAGATGGGCC AAAAGAAAGC CCCCGACGGT 12 00 

25 AGAATTGTTG AGCTTGTCAA GGAAGTTTCA AAGAGCTACC CAATATCTGA 1250 

GGGGCCTGAG AGAGCCAACG AATTAGTAGA ATCCTATAGA AAGGCTTCAA 13 00 

ATAAAGCTTA TTTTGAGTGG ACTATTGAGG CCAGAGATCT TTCTCTGTTG 1350 

GGTTCCCATG TTC GTAAAGC TACC TACC AG ACTTACGCTC CAATTCTTTA 1400 

TGAGAATGAC CACTTTTTCG ACTACATGCA AAAAAGTAAG TTTCATCTCA 1450 

3 0 CCATTGAAGG TCCAAAAGTA CTTGCTTATT TACTTGGTTT ATGGATTGGT 15 00 

GATGGATTGT CTGACAGGGC AACTTTTTCG GTTGATTCCA GAGATACTTC 1550 

TTTGATGGAA CGTGTTACTG AATATGC TG A AAAGTTGAAT TTGTGCGCCG 1600 

AGTATAAGGA CAGAAAAGAA CCACAAGTTG CCAAAACTGT TAATTTGTAC 1650 

TCTAAAGTTG TCAGAGGTAA TGGTATTCGC AATAATCTTA ATACTGAGAA 17 00 

35 TCCATTATGG GACGCTATTG TTGGCTTAGG ATTCTTGAAG GACGGTGTCA 1750 

AAAATATTCC TTCTTTCTTG TC TACGG AC A ATATCGGTAC TCGTGAAACA 1800 

TTTCTTGCTG GTCTAATTGA TTCTGATGGC TATGTTACTG ATGAGCATGG 1850 

TATTAAAGCA ACAATAAAGA CAATTCATAC TTCTGTCAGA GATGGTTTGG 1900 

TTTCCCTTGC TCGTTCTTTA GGCTTAGTAG TCTCGGTTAA CGCAGAACCT 1950 

40 GCTAAGGTTG ACATGAATGG CACCAAACAT AAAATTAGTT ATGCTATTTA 2 000 

TATGTC TGGT GGAGATGTTT TGCTTAACGT TCTTTCGAAG TGTGCCGGCT 2 05 0 

CTAAAAAATT CAGGCCTGCT CCCGCCGCTG CTTTTGCACG TGAGTGCCGC 2100 

GGATTTTATT TCGAGTTACA AGAATTGAAG GAAGACGATT ATTATGGGAT 2150 

TACTTTATCT GATGATTCTG ATCATCAGTT TTTGCTTGCC AACCAGGTTG 2200 

45 TCGTCCATAA TTGCGGAGAA AGAGGTAATG AAATGGCAGA AGTCTTGATG 2250 

GAATTCCCAG AGTTATATAC TGAAATGAGC GGTACTAAAG AACCAATTAT 23 00 

GAAGCGTACT ACTTTGGTCG CTAATACATC TAACATGCCG GTTGCAGCCA 23 50 

GAGAAGCTTC TATTTACACT GGTATC AC TC TTGCAGAATA CTTCAGAGAT 2400 

CAAGGTAAAA ATGTTTCTAT GATTGCAGAC TCTTCTTCAA G ATGGGC TG A 2450 

50 AGC TTTG AG A GAAATTTCTG GTCGTTTGGG TGAGATGCCT GCTGATCAAG 2 5 00 

GTTTCCCAGC TTATTTGGGT GCTAAGTTGG CCTCCTTTTA CGAAAGAGCC 2 550 

GGTAAAGCTG TTGCTTTAGG TTCCCCAGAT CGTACTGGTT CCGTTTCCAT 2 600 

CGTTGCTGCC GTTTCGCCAG CCGGTGGTGA TTTCTCAGAT CCTGTTACTA 2 650 

CTGCTACATT GGGTATCACT CAAGTCTTTT GGGGTTTAGA CAAGAAATTG 27 00 

55 GCTCAAAGAA AGCATTTCCC ATCTATCAAC ACATCTGTTT CTTACTCCAA 27 50 

ATACACTAAT GTCTTGAACA AGTTTTATGA TTCCAATTAC CCTGAATTTC 2 800 

CTGTTTTAAG AGATCGTATG AAGGAAATTC TATCAAACGC TGAAGAATTA 2 85 0 

GAACAAGTTG TTCAATTAGT TGGTAAATCG GCCTTGTCTG ATAGTGATAA 2 900 

G ATTAC TTTG GATGTTGCCA CTTTAATCAA GGAAGATTTC TTGCAACAAA 2950 

60 ATGGTTAC TC CACTTATGAT GCTTTCTGTC CAATTTGGAA GACATTTGAT 3 000 
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ATGATGAGAG CCTTCATCTC GTATCATGAC 
TAATGGTGCC AACTGGTCAA AACTAGCTGA 
ATGCCGTTTC TTCATCTAAA TTTTTTGAAC 
GTCCATGGCG AATTCGAAAA ATTGTTGAGC 
5 TGAATCTACC GATTAA 



GAAGCTCAAA AAGCTGTTGC 3 0 50 

CTCTACTGGT GACGTTAAGC 3100 
CAAGCAGGGG TGAAAAGGAA 3150 
ACTATGCAAG AAAGATTTGC 32 00 

3216 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



2 ) INFORMATION FOR SEQ ID NO : 691 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1860 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schizosaccharomyces pombe 

(B) STRAIN: 972 h- 

(C) ACCESSION NUMBER: S47814 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 691 



ATGGCGGGAG GAATTGAACT GGCCAAGAAG 
TTACGACGAG C ATGAAAAC C GATATGGATC 
CTGTCGTTGT TGCAGCCAAT ATGCTTGGAT 
CGCGTTGGTC ATGAAGAACT AGTTGGTGAA 
TAAATGTACT ATTCAAGTAT ACGAAGAGAC 
ATCCTGTCCA ^ACGCACTGGA AAGCCATTAT 
TTAGCTGAGA CTATTTATGA TGGTATCCAA 
CGACAAATCT CAAAGTATTT ATATTCCTAG 
TTAATCGTGA GCATAAGTGG GATTTCACAC 
GGCGATCATG TATCCGGTGG TGATGTTTTC 
TCTTTTCAAT GATCATAAAA TTATGTTACC 
TCACATATAT TGCTGAAGCT GGATCATACC 
GAAGTCGAGT TTAATGGCAA GAAACATTCT 
GCCTGTCCGT GCTGCTCGTC CAGTTGCGGA 
CTTTATTGAC TGGTCAACGT GTTTTGGATG 
GGTGGC AC T A CTGCTATCCC CGGTGCCTTT 
TTCACAATCT CTTTCTAAGT ACTCTAATTC 
GTTGTGGTGA ACGTGGAAAC G AAATGGC AG 
GAACTAACAA TTGATATTAA TGGTAAACCA 
TACATTGGTA GCCAACACTT CTAACATGCC 
CCATTTATAC C GGTATT AC A CTTGCTGAAT 
AACGTTTCAA TGATGGCTGA TTCTACATCT 
TGAAATTTCT GGTCGTTTGG CTGAGATGCC 
CTTATTTGGG TGCCAAATTG GCTTCTTTTT 
CGTTGCTTGG GAAGTCCTGA CCGTGAAGGA 
TGTTTCTCCT CCGGGTGGTG ATTTTTC TG A 
TGGGAATTGT TCAAGTCTTC TGGGGTTTGG 
AAAC AC TTTC CCTCAATCAA CACCTCTCTT 
TGCTTTGCAA CCTTGGTATG AGGAAAGAGT 
GTGATCAAAT CAAACAGATC ATTCAACAAG 
ATTCAGTTGG TTGGTAAGTC GGCTCTTTCT 
GGACATAGCC GGTATTATTA AG AATG AC TT 
CTGATTACGA TCGCTGTTGC CCTCTTTACA 
AACATGATTG CTTACTACAC AAAGGC T AAA 
CGTTCCTTGG TC AAAGATT A AAGAAAGTAC 



GCTATCAGGA 
TATTTTCAGC 
GTTCGATGTA 
GTAATTCGTA 
GTCCGGTCTC 
CTGTTGAATT 
CGTCCGTTAA 
AGGTATTAAT 
CAAATAAGGA 
GGTTCTGTAT 
CCCTAGAGCC 
ATGTTGATGA 
TTTAGTATGT 
CAACTTAACT 
CGTTATACCC 
GGTTGTGGTA 
TGATTTGATT 
AAGTGTTAAT 
GAGCCCATTA 
TGTCGCTGCT 
ATTATCGTGA 
CGTTGGGCTG 
TGCCGATTCT 
ACGAACGTGC 
ACAGTTTCAA 
TCCTGTTACT 
ACAAGAAATT 
TCCTATTCTA 
TCCAGGCTTT 
AAGATTCCAT 
GAAACGGATA 
CTTACAACAA 
AGACTTATCA 
AGTGCCGTTG 
TTCAGATATC 



GCCTCAAAAA 50 

GTTTCTGGTC 100 

CGAACTCGTT 150 

TCCATCAAGA 200 

ACTGTTGGTG 250 

AGGTC CTGGT 300 

AGCAAATTTT 3 50 

ACAGAATCAC 400 

TTTACGCATT 450 

TTGAAAACTC 500 

CGTGGTACCG 550 

AAAACTTCTT 600 

TGCATACTTG 650 

GCTAATCAAC 700 

CTGTGTTCAA 7 50 

AAACAGTTAT 800 

GTTTACGTCG 850 

GGATTTCCCA 900 

TGAAGCGTAC 950 

CGTGAAGCTT 1000 

TCAAGGTAAG 1050 

AAGCTTTGCG 1100 

GGTTATCCCG 1150 

TGGTCGTGCT 1200 

TTGTTGG AGC 1250 

AGTGCAACCT 1300 

GGCCCAACGT 1350 

AATACATCAA 1400 

AATACTCTTC 1450 

GTTGGAAATT 1500 

AAGTT AC TTT 1550 

AAC GG TT ATT 1600 

TATGATGCGA 1650 

AAACTGGTAG 1700 

TTTTATGAGT 1750 
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TAACCTCGAT GAAATTCGAA AACCCTAATG AAGGCGAGAA GGAAATAGTC 1800 
GAACACTATG AAACTCTGCA CAAGAAGATT GAGGACAAGT TTCACACTCT 1850 
G AC TGAGTAA I860 



2 ) INFORMATION FOR SEQ ID NO: 692 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1833 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma congolense 

(B) STRAIN: IL3000 

20 (C) ACCESSION NUMBER: Z25814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692 

ATGACGAGCG ATAAAAACCC TTACAAAACA GAGCAGCGCA TGGGGGCCGT 50 

25 GAAGGCCGTC TCCGGGCCAG TTGTCATTGC TGAAAACATG GGCGGTAGCG 100 

CTATGTATGA GCTTGTGCAG GTAGGTTCCT TCCGGTTAGT GGGCGAGATC 150 

, ATTCGTCTAG AGGGCGATAC CGCCACTATT CAGGTCTATG AGGAAACAGG 200 

TGGCCTCACT GTCGGAGACC CGGTGTACTG TACGGGTAAG CCTCTTTCGC 250 

TTGAGCTTGG ACCTGGAATC ATGTCTGAAA TATTTGACGG TATCCAGCGG 3 00 

3 0 CCTCTTGACA CCATCTACCG CATGGTGGAA AAC GTGTTT A TCCCCAGGGG 3 50 

CGTTCAGGTG AAGTCACTCA ATGACCAGAA ACAGTGGGAC TTTAAGCCAT 400 

GCCTGAAGGT TGGAGATCTT GTGTCTGGTG GTGATATCAT TGGCTCAGTG 450 

GTGGAGAACT CTCTCATGTA CAATCACAGC ATTATGATTC CGCCCAATGT 500 

GCGGGGCCGT GTTACTTCCA TTGTTCCTTC AGGAAATTAC ACCCTCCAAG 550 

3 5 ATGACATTAT TGAATTGGAA TATAATGGGA CAGTGAAATC ACTAAAACTT 600 

ATGCATCGCT GGCCAGTACG GACCCCGCGT CCTGTGGCGT CAAAAGAATC 650 

CGGCAATCAT CCGCTTCTCA CCGGACAGCG TGTGCTCGAT GCTCTCTTTC 7 00 

CATCCGTCCA GGGTGGAACA TGCGCCATCC CTGGC GCGTT TGGATGCGGA 750 

AAGACGGTTA TCAGTCAGGC TCTTTCGAAG TTCTCCAACA GCGACGCTGT 800 

40 TATC TATGTC GGCTGCGGCG AGCGTGGGAA TGAGATGGCA GAGGTGCTCA 850 

TGGACTTCCC CACACTCACC ACCGTTATTG ATGGTCGTGA GGAGTCCATC 900 

ATGAAGCGTA CCTGCCTGGT GGCAAACACC TCAAATATGC CTGTCGCTGC 950 

TCGTGAGGCG TCTATTTACA CTGGC ATC AC TTTAGCTGAG TATTATCGTG 1000 

ATATGGGCAA GCACATTGCT ATGATGGCCG ACTCTACCTC TCGATGGGCT 1050 

45 GAGGCTCTCC GTGAGATCTC TGGGCGTCTC GCTGAAATGC CCGCTGATGG 1100 

TGGTTACCCT GCGTACCTCA GTGCGCGTCT TGCTTCCTTC TACGAGCGTG 1150 

CGGGGCGCGT GACATGCATC GGTGGGCCAA AACGCGAGGG CTCAGTAACC 1200 

ATCGTTGGTG CCGTTTCTCC TCCTGGAGGT GACTTTTCTG ACCCAGTGAC 1250 

GTCCGCTACG CTTGGTATTG TGCAAGTCTT TTGGGGTCTT GAGAAGCGTC 1300 

50 TTGCGCAACG TAAACACTTT CCTTCTGTTA ATTGGCTCAT TTCCTATTCA 1350 
AAAT AC CTTA ATGC TTTGG A GCCCTTCTTC AACACGCTTG ACCCTGACTA ' 1400 

CATGCGCCTG CGGTCAGTTG CTGCGGAGAT CCTTCAGCGT GAGGAAGAGT 1450 

TGCAAGAAAT TGTTCAACTT GTCGGTAAGG ACTCACTTTC GGAGTCTGAC 1500 

AAAATTATTC TAGAAACGGC TAAGGTTATT CGTGAAGAGT TTCTCCAGCA 1550 

55 GAATGCC TTT ACGCCGTACG ACAAGTATTG CCCGCCGTAC AAGAC CTGCT 1600 

GGATGCTACG TAACATTGTC GCGTTCTACG AGGAGAGCCA GCGCGTTGTA 1650 

GCTGAGTCCG CTGGGGAACT TAAGATTACG TGGAACTACA TTCGTGAAAT 17 00 

GATTCCTCAT ATTTACACGG GTTTAACTGA GATGAAGTTC CGTGATCCTC 17 50 

AGGAGGGTGA GGAGGCCAAC GTAGAATTCT ACAGAAAACA AAATGAGGAA 1800 

60 ATTGTCAGCG CATTCGCCTC GCTGCTGCAA TAA 183 3 
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2) INFORMATION FOR SEQ ID NO: 693 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1758 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Thermus thermophilus 

(B) STRAIN: HB8 

(C) ACCESSION NUMBER: D63799 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693 

20 ATGATCCAAG GGGTGATCCA GAAGATCGCG GGCCCGGCGG TGATCGCCAA 50 

GGGCATGCTC GGGGCCCGCA TGTACGACAT CTGCAAGGTG GGCGAAGAGG 100 

GCCTCGTGGG CGAGATCATC CGCCTGGACG GGGACACGGC CTTCGTCCAG 150 

GTCTACGAGG ACACCTCGGG CCTAAAGGTG GGGGAGCCCG TGGTCTCCAC 2 00 

25 GGG?CTTCCC JtGGCGGTGG AGCTCGGCCC CGGGATGCTG AACGGCATCT 250 

ACGACGGCAT CCAGCGCCCC C TGGAGCGC A TCCGGGAGAA GACGGGGATC 3 00 

?A?ATCACCC GGGGCGTGGT GGTCCACGCC CTGGACCGGG AGAAGAAGTG 350 

GGCCTGGACG CCCATGGTCA AGCCCGGGGA CGAGGTGCGG GGGGGTATGG 400 

TCCTGGGCAC GGTGCCCGAG TTCGGCTTCA CCCACAAGAT CCTGGTACCC 450 

30 CCGGACGTGC GGGGCCGGGT CAAGGAGGTG AAGCCCGCCG GGGAGTACAC 500 

CGTGGAGGAG CCGGTGGTGG ~TC CTCG AGG A CGGCACCGAG— CTCAAGATGT 55 0 

" ACCACACCTG GCCCGTTCGC CGGGCGAGGC CCGTGCAAAG GAAGCTTGAC 600 

CCCAACACCC CCTTCCTCAC GGGGATGCGC ATCCTGGACG TCCTCTTCCC 650 

CGTGGCCATG GGGGGCACCG CCGCCATCCC TGGGCCCTTC GGCAGCGGCA 700 

3 5 AGACCGTGAC CCAGCAGTCC CTGGCCAAGT GGTCCAACGC CGACGTGGTG 750 

GTCTACGTGG GCTGCGGGGA GCGGGGGAAC GAGATGACCG ACGTGCTCGT 800 

GGAGTTCCCC GAGCTCACCG ACCCCAAGAC GGGTGGGCCC TTGATGCACC 850 

GCACCGTCCT CATCGCCAAC ACCTCCAACA TGCCCGTGGC CGCCCGCGAG 900 

GCCAGCATCT ACGTGGGCGT GACCATCGCC GAGTACTTCC GCGACCAGGG 95 0 

40 CTTCTC CGTG GCCCTCATGG CCGACTCCAC GAGCCGCTGG GCCGAGGCTT 1000 

TGCGCGAGAT CTCTAGCCGC CTCGAGGAGA TGCCCGCCGA GGAGGGCTAC 1050 

CCGCCCTACC TCGCCGCCAG GCTCGCCGCC TTCTACGAGC GGGCGGGCAA 1100 

GGTCATCACC CTGGGCGGCG AGGAGGGGGC GGTGACCATC GTGGGGGCCG 1150 

TCTCCCCGCC GGGCGGCGAC ATGTCCGAGC CCGTGACCCA GTCCACCTTG 1200 

45 AGGATCGTGG GGGCCTTCTG GCGGCTTGAC GCCTCCCTGG CCTTCCGCCG 1250 

CCACTTCCCC GCCATCAACT GGAACGGCTC CTACAGCCTC TTCACCTCCG 1300 

CCCTTGACCC CTGGTACCGG GAGAACGTGG CCGAGGACTA CCCCGAGCTC 1350 

CGCGACGCCA TCTCCGAGCT TTTGCAGCGG GAGGCGGGCC TCCAGGAGAT 1400 

CGTCCAGCTC GTGGGGCCGG ACGCCCTCCA GGACGC CGAG CGCCTCGTCA 1450 

50 TTGAGGTGGG CCGGATCATC CGCGAGGACT TCCTGCAGCA GAACGCCTAC 1500 

CACGAGGTGG ACGCCTACTG CTCCATGAAG AAGGC C TACG GGATCATGAA 1550 

GATGATCCTC GCCTTCTACA AGGAGGCGGA GGC GGC CATC AAGCGGGGGG 1600 

TTTCCATAGA CGAGATCCTG CAGCTCCCCG TTCTGGAGCG CATCGGCCGC 1650 

GCCCGCTACG TGAGCGAGGA GGAGTTCCCC GCCTACTTTG AGGAGGCCAT 17 00 

55 GAAGGAGATC CAGGGGGCCT TCAAGGCTGG CCTAAAGGGG GAGAGATGGA 1750 
CCTTCTGA 



60 2 ) INFORMATION FOR SEQ ID NO: 694 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 694 

CGGCGCIATC YTSGTTGTTG C 21 



15 



25 



30 



2 ) INFORMATION FOR SEQ ID NO: 695 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
20 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : S ing 1 e 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695 
GTTTCACGTG ATGACGTACA 2 0 

2 ) INFORMATION FOR SEQ ID NO: 696 



(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 
ATIGGICAYR TIGAYCAYGG IAARAC 2 6 

45 

2 ) INFORMATION FOR SEQ ID NO: 697 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697 

60 CCIACIGTIC KICCRCCYTC RCG 2 3 

601 
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2 ) INFORMATION FOR SEQ ID NO : 698 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Escherichia coll 

(C) ACCESSION NUMBER: extracted from J01690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698 

20 GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GT AAAAC T AC TCTGACCGCT GCAATCACCA 100 

CCGTACTGGC TAAAACCTAC GGCGGTGCTG CTCGTGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

CGTTGAATAC GACACCCCGA CCCGTCACTA CGCACACGTA GACTGCCCGG 250 

25 GGCACGCCGA C T ATGTT AAA AACATGATCA CCGGTGCTGC TCAGATGGAC 3 00 

GGCGCGATCC TGGTAGTTGC TGCGAC TG AC GGCCCGATGC CGCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 400 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAAC TGGTT 450 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 5 00 

^0 CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 550 

AGTGGGAAGC GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCTTATATT 600 

C C GGAAC C AG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 650 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 7 00 

GCGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG 750 

35 ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

CGAAGGCCGT GC TGGTG AG A ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 9 00 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 950 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 1000 

4 0 GTACTACTGA CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG 1050 

^ GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 1100 

CGCGATGGAC GACGGTCTGC GTTTCGCAAT CCGTGAAGGC GGCCGTACCG 1150 

TTGGCGCGGG CGTTGTTGCT AAAGTTC TGG GCTAA 1185 

45 

2 ) INFORMATION FOR SEQ ID NO: 699 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 699 
GTIACIGGYT CYTYRARRTT ICCICC 26 

60 
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10 



20 



50 



60 



2 ) INFORMATION FOR SEQ ID NO : 7 00 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700 
15 TIRTIGAYGT CGARTTCCCT CARG 24 



2 ) INFORMATION FOR SEQ ID NO: 701 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 01 

30 

GTGTTCACGA TCATCGATGC - G . 21 



3 5 2 ) INFORMATION FOR SEQ ID NO: 702 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702 

CTCTCGATAT CCGCGAAGCG 20 



2 ) INFORMATION FOR SEQ ID NO: 703 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 03 

20 

TATGG AAATT CGAAAC ATCT 



2 ) INFORMATION FOR SEQ ID NO: 7 04 

(i) SEQUENCE CHARACTERISTICS: 
IQ (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 04 

20 

AGTGCTCCAA TTAATGTTGG 

20 



2 ) INFORMATION FOR SEQ ID NO: 7 05 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 05 
35 GTACAGTTCC AATACCTGAA 



2 ) INFORMATION FOR SEQ ID NO: 706 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706 
TGAAATCTTC ACATCCAACA 



5 5 2 ) INFORMATION FOR SEQ ID NO: 707 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 

604 
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10 



20 



25 



40 



(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707 
TWACCATTTC AGTACCTTCT GGTAA 2 5 

2 ) INFORMATION FOR SEQ ID NO: 708 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
15 (B) TYPE: Nucleic acid 

<C) STRANDEDNESS : Single 
(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708 
TCRTCCATIC CIARIATIGC I AT I AT 2 6 

2 ) INFORMATION FOR SEQ ID NO : 709 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia burgdorferi 

(C) ACCESSION NUMBER: extracted from AE001122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 09 



ATGAATGAAG TTTTATTTGT AAAGACTGCT GGTAGGAATT TAAAAGCAGA 50 

AGTAATTCGT ATTAGGGGCA ATGAAGTTGA TGCACAGGTT TTTGAATTGA 100 

45 CAAAAGGGAT ATCTGTTGGA GACCTAGTTG AATTTACAGA CAAACTTTTA 150 

ACAGTTGAAC TCGGACCAGG GCTTTTAACT CAAGTATATG ATGGGCTTCA 2 00 

AAATCCTTTG CCTGAATTGG CTATTCAATG TGGATTTTTT TTAGAAAGGG 250 

GAGTATATTT AAGGCCCTTG AATAAAGATA AAAAGTGGAA TTTTAAAAAA 3 00 

ACCTCCAAAG TTGGAGATAT CGTTATTGCA GGAGATTTTT TAGGTTTTGT 3 50 

50 AATTGAGGGA ACTGTTCACC ATCAAATAAT GATTCCATTT TATAAAAGGG 400 

ATTCTTATAA AATTGTGGAG ATTGTAAGTG ATGGCGACTA TTCGATTGAT 450 

GAGCAAATTG CTGTAATTGA AGATGATTCT GGTATGAGGC ATAATATTAC 500 

AATGTCTTTT CATTGGCCTG TTAAAGTTCC TATTACTAAT TATAAGGAAC 550 

GCCTTATTCC TAGTGAACCT ATGTTGACTC AAACTAGAAT TATAGATACA 600 

55 TTTTTC C C AG TTGCCAAAGG TGGAACTTTT TGCATTCCGG GTCCTTTTGG 650 

AGCAGGAAAA ACGGTTCTTC AGCAGGTTAC AAGTCGAAAT GCTGATGTTG 7 00 

ATGTAGTGAT TATTGCAGCT TGTGGTGAGC GAGCAGGAGA AGTGGTAGAA 750 

ACTCTTAAAG AATTTCCCGA ATTAATGGAT CCAAAAACCG GCAAATCTTT 800 

AATGGACAGG AC TTGTATT A TTTGTAATAC ATCTTCAATG CCAGTTGCAG 85 0 

60 C T AG AG AAGC TTCTGTTTAT ACTGCTATTA CTATTGGTGA GTATTACAGG 900 
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CAAATGGGCC 
TCAAGCAATG 
AGGCTTTTCC 
GCAGGTATTG 
5 TGGCTCTGTA 
C AAC TTT AAA 
GATGCTAGGA 
TAAAGGCGTT 
TGAAAGGTAA 

10 ATAAGTAACG 
GTGCTATTTG 
CAGAGCGTCA 
AACTTTGAGT 
AAGGCAAAAT 

15 TTAATAAATT 
ATTTAG 



TTGATATTCT 
AGAGAAATGT 
GGCATATCTT 
TAGTTCTTAA 
AGTCCTGCTG 
AGTTGTAGGA 
AATTTCCAGC 
ATTGATCAAA 
TGAAATTAAT 
ATGATTTTTT 
CAGCAAAATT 
AAATTATATG 
TTTCTGATAA 
CTTTTAGACA 
GGAGCATGCT 



TCTTTTGGCA 
CTGGACGCCT 
GAGTCTGTTA 
T AATGG GG AT 
GTGGTAATTT 
GCATTTCACG 
TATTAGTCCT 
AAAAGACTGA 
CAAATGATGA 
AATTTATTTA 
CATTTGATTC 
TTTGATATAG 
ACTTCAAGCA 
TGAATCTTTC 
TTGGGTGAAT 



GATTCAACTT 
TGAGGAAATT 
TTGCTTCCTT 
ATTGGATCTG 
TGAAGAGCCA 
GGCTTACAAG 
CTTGAATCTT 
ATATGCAAGA 
AAGTTGTTGG 
AAATCCGAGC 
TATTGATGCT 
TTTATAACAT 
AGAGATTTTA 
TTCTTTTAAG 
TGATAAATTT 



CAAGATGGGC 9 50 

CCTGGCGAGG 1000 

TTATGAAAGG 1050 

TAACAGTTGG 1100 

GTTACTCAAG 1150 

AGAAAGGTCT 12 00 

GGAGTAAATA 12 50 

TCTTTTTTGG 13 00 

AGAAGAAGGC 13 50 

TAC TTGATTC 1400 

GCTGTTAGTT 1450 

TCTTAAAACT 1500 

TAAATGAGTT 1550 

GATCATAAGT 1600 

TAAAAAGGTA 1650 
1656 



2 0 2 ) INFORMATION FOR SEQ ID NO : 710 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 bases 

(B) TYPE: Nucleic acid 

2 5 (C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



3 0 (vi) ORIGINAL SOURCE: 

(A) — ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols 

(C) ACCESSION NUMBER: extracted from AE000520 
3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 710 

GTGATCAAAG ACGATGTGGT TACAGGCCGT GTAGTGAGGG TGTCTGGTCC 50 

CATTGTGTAT GCCGAGGGCC TCTCTGCGTG CAGCGTATAC GATGTTGTCG 100 

ACGTAGGGGA AGCATCGCTC ATCGGAGAAA TTATCCGGTT GGATGAGAGC 15 0 

40 AAGGCGGTCG TGCAAGTATA CGAGGATGAC ACAGGTATGC GAGTCGGGGA 2 00 

GAAGGTGACA AGCTTGCGTC GACCACTCTC AGTCCGCTTA GGGCCTGGAT 250 

TAATCGGCAC CATTTATGAC GGTATTCAGC GCCCACTTGA GCGCCTCTTC 3 00 

CAAGAAGACG GCGCCTTCTT GCGTCCTGGT GCGCGTTCAC AAC CGCTTG A 350 

TGGCTCCGTA CGCTGGGATT TTCGTCCTCA TTGTAACGAG CGCGGTGAGG 400 

45 CCCTGTGCGC GGGGATTCCG ATTGCACCTG GGTCAGTGTT AGGG AC CGTG 45 0 

C AGGAGAC TC CTTC TGTTGT GCACACTATC ATGGTTCCTC CTGACATCCG 500 

GGGGAGCGTG CTATCTTCGT TCAAGGGCGC AGGTGC TTAC ACAATAGATG 550 

AAGAAATTGG ACGCACTGAT CTTGGTGAGC CGCTTTTTCT ATCCCAGTAC 600 

TGGCCAGTGC GTCGTGCGCG TCCTTTCAGC AAAAAACTTG CAGTGTGTGA 65 0 

50 GCCACTAGTT ACTGGACAGC GGGCGATTGA TGTTTTCTTC CCCCTATCAA 7 00 

AGGGAGGAAC GGCGGCTATT CCAGGGGGAT TTGGAACTGG GAAGACAATG 750 

ACGCAGCATG CCGTTGCCAA GTGGTGTGAT GCAGATATTA TCGTGTACAT 800 

CGGCTGCGGA GAGCGGGGCA ACGAGATGAC AGACGTGCTC TCTGAATTTC 850 

CCAAACTCAT CGATCCGCGC ACAGGACGCT CTCTTATGGA GCGG AC G ATT 9 00 

55 TTGATCGCAA ATACGTCCAA TATGCCTGTG TCCGCACGCG AGGTGTCGCT 950 

GTATTCAGGG ATTACCCTTG CGGAATACTA CCGTGATATG GGTATGCATG 100 0 

TGGCCATCAT GGCTGATTCT ACCAGCCGCT GGGCGGAGGC GCTGCGTGAA 105 0 

TTGTCTGGGC GCATGGAAGA AATGCCTGCG GAGGAGGGAT TCCCTGCGTA 1100 

CCTTCCGACG CGTCTTGCAG AATTTTATGA GCGCGCAGGA CGCGTGGAAA 1150 

60 CCTGTGTGGC GCGCGAGGGC TCTGTGAGCA TCATTGGTGC TGTTTCTCCC 1200 

606 
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CTGGGTGGAG ATTTCTCTGA GCCGGTGACG CAGCACACAA AGCGCTTCAT 12 50 

CCGTTGCTTT TGGGCCTTGG ATCGTGAACT TGCACACGCG CGTCATTACC 13 00 

CTGCCATTGG GTGGATAGAT TCATACTCTG AATATGCGCA GGAAGTAAGT 13 5 0 

GCATGGTGGA GTAAGTATGA CCCGCGCGCA GGCGCGTTGC GCGCCGCAGC 14 00 

5 CTTGGATTTG CTGAGAAAGG AACAGCGGTT ACAGCAAATT GTCAGGCTTG 14 50 

TCGGTCCTGA TGCGCTGCCT GGAGAAGATC GTCTGGTGCT AATGGTGTGT 15 00 

GAAATGATCA AAGGTGGCTT TCTGCAGCAG AACGCTTTTG ATCCGACGGA 1550 

TGTGTTCTCC TGTCCCGAAA AGCAGGTGCA GATCTTGCGT AC C ATAGTGG 1600 

ATTTTCACGA ACGTGCCGTG GTGCTGCTGC GTGCAGGTAT TTCGCTTTCT 1650 

10 GCGCTGTCCC AGCTTTCGTG CCGGGAGCTC ATCGTACGTA TGAAAACTAC 17 00 

GTACGGGAAT GAGGATGTAC ACAAGATGCA GAAAGTGTAC GACACGATGT 1750 

GCACTGAGTT TGACCAACTG AGTGTGTGTG CTGCCGCGCG CACACAAGGG 1800 
GGGGAGAAAG TCGAATGA ' 1818 



15 



30 



2 ) INFORMATION FOR SEQ ID NO: 711 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1779 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 

(B) STRAIN: MoPn 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711 



CAGGGCTATG TCGTAGAAGC TTACGGAAAT TTATTGCGGG TGCATTTTGA 50 

TGGGCATGTG CGTCAAGGAG AAGTGGC CTA TGTCAGCGTG GATGATACTT 100 

3 5 GGTTGAAAGC GGAAATTATA GAAGTTGTGG GAGATGAGGT TAAAGTCCAA 150 

GTTTTTGAGG AAACTCAAGG AATTTCTCGA GGCGCTTTGG TAACTTTTTC 2 00 

CGGGCATTTA TTAGAAGCGG AACTTGGGCC CGGTCTATTG CAAGGTATTT 2 50 

TTGACGGACT TCAGAATCGC TTAGAGGTAT TGGCAGATAC AAGCTTGTTC 3 00 

TTGAAAAGAG GGGAGTATGT TAATGCCATT TGTCGGGAAA CTGTATGGGC 3 50 

40 TTATACGCAA AAGGCTTCTG TCGGGGATGT TCTATCTCGG GGAGATGTGC 400 

TTGGTACAGT AAAGGAAGGG CGGTTTGATC ATAAAATCAT GGTTCCTTTC 450 

TCTTGTTTTG AGGAAGTGAC TATCACTTGG GTCATTTCTT CAGGAGATTA 500 

C AC TGTTGAT ACCGTTATTG CTAAAGGACG TACTGCTTCA GGAGCCGAGC 550 

TTGAATTTAC AATGGTTCAG AAATGGCCCA TTAAACAGGC TTTTTTAGAA 600 

45 GGGGAAAAGG TACCGTCTCA TGAAATTATG GATGTTGGGT TACGAGTATT 650 

AGATAC TC AG ATCCCCGTCT TAAAGGGAGG AACTTTTTGT ACTCCAGGGC 7 00 

CTTTTGGTGC AGG AAAGAC C GTTTTACAGC ACCATTTATC TAAGTATGCA 7 50 

GC TGTAGAT A TCGTAGTTTT GTGTGCTTGT GGAGAGCGAG CTGGAGAGGT 800 

TGTAGAAATT CTTCAGGAGT TCCCGCATTT GACAGATCCT CATACGGGGC 850 

50 AGTCTTTGAT GCATAGGACC TGTATTATTT GTAATACATC TTCCATGCCT 900 

GTAGCAGCTA GAGAGTCCTC CATTTATTTG GGTATTACTA TAGCAGAATA 950 

TTACCGTCAA ATGGGGTTGC ATGTTTTGTT ATTGGC TG AC TCGACATCTA 1000 

GATGGGCTCA AGCTTTAAGG GAAATTTCAG GGCGATTAGA AGAAATCCCT 1050 

GGAGAAGAAG CTTTCCCAGC CTATTTGGCG TCTCGAATAG CAGCTTTTTA 1100 

55 TGAGCGAGGC GGGGCTGTGA AAATGAAAGA TGGATCGGAA GGCTCCTTGA 1150 

CTATCTGTGG AGCGGTTTCT CCCGCAGGAG GAAATTTTGA AGAGCC TGTT 12 00 

ACACAAGCAA CTTTATCTGT TGTTGGGGCT TTCTGTGGGC TTTC TAAGGC 1250 

TAGAGCAGAT GCTAGACGGT ATCCTTCTAT TGATCCGATG ATTTCATGGT 13 00 

CTAAGTACTT GGATTCTGTG GCGGAGATTT TAGAGAAAAA AGTTCCAGGA 13 50 

60 TGGGGAGATT CCGTTAAAAA AGCTTCTCGT TTCTTAGAAG AAGGAGCAGA 1400 

607 
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AATTGGTAAG CGAATAGAAG TTGTTGGGGA 
ATATAGAAAT CTTTTTGAAA TCAGAGTTGT 
CAAAACGCTT TCGATGCAGA G G AC TGTT AT 
AGAGCTTTTT TCTTTAATGA GTCATATTTT 
5 ATTGTCCAGA TAATGCTCGG AGTTTCTTTT 
AAAACGCTGA ATGGTCAAAA ATTCCTTTCT 
AGAAGTGATC TATAAACTAT TAGAAAGCAA 
TATGCAAACA ATATATACAA GAATTACGG 



AGAAGGGATT TCTATGGAAG 1450 

ATGATTTCTG TTACTTACAG 1500 

TGTCCTTTTG ATCGTCAAAT 1550 

TAGCTCTAGA TTCTGTTTTG 1600 

TAGAGCTTCA AAGTAAAATT 1650 

GAAGACTATC AGAAGGGGCT 17 00 

AATGGTGCAG ACGGCGTAGG 17 50 

1779 



2 ) INFORMATION FOR SEQ ID NO: 712 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 965 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: V583 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712 

GTGCAAATTG GAAAAATTGT CAAAGTTTCA GGTCCTTTGA TTTTAGCTGA 50 

AAACATGTCA GATGCTAGTA TCCAAGACAT TTGTCATGTA GGAGATTTAG 100 

30 GCGTTATCGG AGAGATTATT GAAATGCGAG GCGACGTCGC TTCGATTCAA 150 

GTATATGAAG AAACAACAGG CATTGGACCA GGAGAACCAG TTATTTCAAC 2 00 

AGGAGAACCA TTATCTGTTG AATTAGCCCC AGGTTTAATT GCCGAAATGT 2 50 

TTGATGGTAT TCAACGACCA TTGGATACAT TTCAAGAAGT AACCCACAGT 300 

AACTTTTTAG GCCGTGGCGT TAAAATTGAT GCGTTAGATC GTGAGAAAAA 350 

3 5 ATGGACGTTT GAACCAACTG TGGCAGTTGG TGAAGAAGTG TCGGCAGGTG 400 

ACATCGTCGG TGTGGTTCAA GAAACACCGA TTATTCAACA TAAAATTATG 450 

GTGCCTTTCG GCGTTTCAGG AACGATTGCC GAAATTAAAG CAGGTGACTT 500 

TGCCATTGAT GAAACAGTTT ACTCAGTGGA AACGGCTAAA GGAACGGAAA 550 

GTTTTAGCAT GATGC AAAAA TGGCCCGTTC GGCGGGGACG TCCCATTTTA 600 

40 GAAAAACTAA GTCCCAAAGT ACCGATGGTG ACCGGACAAC GCGTAATTGA 650 

TACCTTTTTC CCAATTACGA AAGGCGGAGC GGCAGCAGTT CCAGGACCAT 700 

TTGGCGCTGG AAAAACAGTC GTTCAGCACC AAATTGCTAA GTGGGC CGAT 750 

GTCGACTTAG TCGTTTACGT TGGTTGTGGG GAACGCGGGA ATGAAATGAC 800 

AGATGTTTTA AATGAATTTC CAGAATTAAT TGACCCAACA ACTGGTGAGT 850 

45 CTTTGATGAA TCGGACGATT TTAATTGCGA ATACGTCAAA TATGCCGGTA 900 

GCGGCACGGG AAGCCTCGAT TTATACAGGG ATT AC C ATTG CAGAATATTT 950 

CCGTGATATG GGTTA 965 



50 



60 



2 ) INFORMATION FOR SEQ ID NO: 713 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1737 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

608 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina barker! 

(C) ACCESSION NUMBER: extracted from J04836 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 713 

GTGGAAGTAA AAGGTGAAAT TTATCGTGTG TCTGGGCCTG TCGTCACCGC 50 

CATCGGCTTG CAGGCAAAAA TGTATGACCT GGTCAAAGTC GGTAATGAAG 100 

GTTTAATGGG TGAAGTCATT CAGATATTAG GGCCCAAGAC CATCATCCAG 150 

10 GTATATGAAG AGACCGCAGG TATCAAGCCA GGGGAACCCT GTGTATCTAC 2 00 

AGGGTCGTCT CTGTCCGTAG AACTTGGTCC GGGTCTTCTT TCCAGTATTT 2 50 

ATGACGGGGT TCAAAGGCCT CTGCACGTCC TGC TTGAAAA AATGGGT AG C 300 

TTCATCCAGA GAGGTGTCAG CGCAGATGGG CTTGATCATA AG AAACTC TG 3 50 

GGATTTCAAA CCCATTGTCA AGAAGGGCGA TTCCGTAAAA GGTGGAG AC G 4 00 

15 TAATTGGTGT TGTACAGGAA ACCGTGAATA TTGAACATAA GATCATGGTG 450 

C C TC C TG AT A TCTCAGGTAC AATTTC C G AC ATAAAGAGCG GAAACTTTAC 5 00 

GGTAGTAGAC ACAATCTGTA C TCTGAC TG A TGGGACCGAA TTGCAGATGA 550 

TGCAGAGGTG GCC TGTTCG A AG AC CC AGAC CTGTGAAGGC AAAACTTACT 600 

CCAACCAGGC CTCTGGTTAC AGGAATGAGA ATCCTTGATG GGCTTTTCCC 650 

2 0 TGTGGCAAAA GGCGGAACAG CTGCAATCCC CGGACCTTTC GGATCGGGAA 7 00 

AGACCGTAAC TCAGCAGTCG CTTGCAAAAT GGAGTGATAC CGAAATTGTG 7 50 

GTCTACATCG GTTGTGGTGA GCGTGGAAAC GAAATGGCAG ATGTTCTGAG 800 

CGAATTCCCT GAACTCGAAG ATCCGCAGAC CGGGCGCCCA CTTATGGAGC 850 

GTACTGTTCT TATCGCTAAC ACTTCAAACA TGCCTGTGGC CGCAAGAGAA 900 

25 GCATCTGTGT ATACCGGAAT CACCATTGCA GAATACTACC GTGACATGGG 950 

ATTAGATGTA TCC CTTATGG CAGACTCCAC CTCAAGGTGG GCAGAAGCCA 1000 

TGAGAGAAAT CTCTTCCCGT C TGG AAG AAA TGCCTGGTGA AGAAGGTTAC 1050 

CCAGCATACC TGTCTGCAAG ACTGGCCGAA TTCTACGAGC GTGCCGGGGT 1100 

TGCGGAGAGT CTTTGCGGCG AAACAGGTTC CATTACTGTT ATTGGAGCAG 115 0 

30 TATCTCCACC TGGCGGTGAC TTCTCAGAGC CTGTTACACA GAATACCCTG 12 00 

CGTATCGTAA AAGTGTTCTG GGCTCTCGAT GC C AAACT AT CTCAGAGGCG 1250 

TCACTTCCCG GCCATCAACT GGCTGAACAG TTACAGTCTG TATAAGGACA 13 00 

GTCTTAATGA CTGGTTTGCA GATAATGTGG CTCCTGATTA TGTGCCTTTG 13 50 

AGGGAAAGAG CAATGGAAAT GCTC C AG AC A GAATCTGAAC TGCAGGAAAT 1400 

35 CGTGCAGCTT GTAGGTTCCG ATGCTCTGCC AGACGACCAG CAGCTTCTGC 1450 

TTGAAATCAC CCGTATGCTT AGGGAAATTT TCCTGCAGCA GAATGCATTC 15 00 

CACCCAGTAG ATGCATACAG CCCGTTCGAT CAGCAGTACA AGATCCTTAA 1550 

GGCAATCATG AAATGGGGAG ACGCTGCGAT GGATGCCTTG AAATCAGGTG 16 00 

TTCCCGTAAC TGAAATTATC AAGCTTGAAT CCAAAAATGT GCTTGCTAAG 1650 

40 GTCAAGTACG AAGAGAAGTT TGATGAGTCT ATGAATGCTG TCCTGGCACA 17 00 

GATGGATAAA GAGTTTGCAT CCCTGAGAGG TAGGTAA 17 37 



45 2) INFORMATION FOR SEQ ID NO: 714 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1785 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina jannaschii 

(C) ACCESSION NUMBER : extracted from U67477 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714 

609 
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ATGAAAAGAG AGGTTGAGAA TATGCCAGTT GTTGGTAAGA TTATTAAAAT 50 

CGCAGGGCCT GTTGTAGTTG CAGAGGGAAT GAAAGGAGCT CAGATGTATG 100 

AGGTCGTTAA AGTAGGAGAA GAGAAATTGA CTGGAGAAAT CATTCAGTTG 150 

CACGATGATA AAGCAGTTAT TCAGGTTTAT GAAGAAACAT CTGGAATTAA 200 

5 ACCAGGAGAG CCAGTTGTTG GTACTGGAGC TCCATTGTCT GTTGAATTAG 2 50 

GGCCAGGGAT GTTAAGAGCT ATGTATGATG GTATTCAGAG GCCTTTAACA 3 00 

GCAATTGAAG AGAAAACAGG TTCAATCTTT ATCCCAAGAG GAGTTGATGT 3 50 

CCCTGCATTA CCAAGAGATA TAAAATGGGA ATTTAAACCA GTGGTAAATG 4 00 

AAGGAGATTA TGTTGAAGAA GGAGACATAA TTGGAAC TGT TGATGAAACT 450 

10 CCTTCAATAG TTCATAAAAT CTTAGTTCCA ATTGGTGTTA AAGGAAAAAT 500 

TGTTGAAATA AAAGAGGGTA AATTTACAGT TGAAGAGACA GTTGCAGTTG 550 

TAGAAACAGA AAATGGAGAA AGGAAAGAAA TTACAATGAT GCAAAAATGG 600 

CCAGTAAGAA AACCAAGACC ATATAAAGAG AAACTACCTC CAGAAATTCC 650 

ATTAATTACA GGGCAAAGAG TTGAAGACAC TTTC TTT AC A TTAGCAAAAG 7 00 

15 GAGGAACAGC AGCAATTCCA GGTCCATTCG GTTCAGGAAA AACGGTTACT 7 50 

CAGCATCAGT TGGCAAAGTG GTCTGACGCT GATGTCGTTG TTTATATCGG 800 

ATGTGGAGAA AGAGGAAACG AGATGACAGA GGTTATTGAA GAGTTCCCAC 850 

ACTTAGAAGA TATTAGAACT GGAAACAAAT TAATGGATAG AACTGTATTA 900 

ATAGCCAACA CATCAAACAT GCCTGTCGCT GCAAGGGAAG CATCTGTCTA 950 

20 TACAGGAATT ACAATTGCAG AGT ACTTC AG AGATATGGGT TATGGAGTTT 1000 

TATTAACAGC AGATTCAACA TCAAGATGGG CAGAGGCAAT GAGAGAAATT 1050 

TCAGGTAGAT TGGAAGAAAT GCCAGGGGAA GAAGGGTATC CAGCATACTT 1100 

AGCTTCAAGA TTGGCTCAGT TCTATGAAAG AGCTGGAAGA GTTATAACCT 1150 

TAGGGAAAGA TAACAGACAA GGATTCGTTT GTATCGTTGG AGCTGTTTCA 1200 

25 CCACCAGGAG GGGACTTCTC AGAACCAGTT ACATCAAACA CACTAAGGAT 1250 

AGTTAAGGTA TTCTGGGCGT TAGATGCAAA CTTGGCAAGA AGAAGACACT 13 00 

TCCCAGCTAT CAACTGGTTG CAGAGTTATT CATTATACAT TGATGATGTT 1350 

ACAGAGTGGT GGAACACAAA TACTGGTCCA GATTGGAGAC AATTAAGAGA 1400 

TGAAGCAATG AGCTTATTAC AAAAAGAGGC AGAGTTGCAA GAGATTGTTC 1450 

3 0 AGTTAGTTGG GCCTGATGCA TTGCCAGATA GGGAGAGAGT TATTTTAGAA 1500 

GTTGCAAGAA TGTTGAGGGA GGATTTCTTA CAGCAAGATG CGTTTGATGA 1550 

GGTAGATACC TACTGTCCTC CAATGAAACA GT AC TTAATG TTAAAGATAA 1600 

TTATGACATT CTACCAAGAA GCATTGAAGG CAGTTGAAAG AGGAGTTGAA 165 0 

CCAGCTAAGA TTTTAGGAGT TTCAGTTAAG CAAGATATTG CAAGAATGAA 17 00 

35 ATACATCCCA CACGATGAGT TTATAAATGT TAAATCAAAA GAAATAATGG 1750 

AGAAAATTAA GAATGAATTA GGTTCATTAA ACTAA 17 85 



40 2 ) INFORMATION FOR SEQ ID NO: 715 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1354 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(B) STRAIN: W83 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715 



TGAGCGAGGT GATCAAGGTG ATCGGCAAAA ATGCTTATGT GCAGGTTTTC 50 

GAAAGTACTC GCGGTATGCA CGTAGGAGAT GAGGCAGAGT TTACCGGCAG 100 

TATGC TTG AG GTAACGCTCG GCCCCGGTAT GCTTTCGAAG AACTACGACG 150 

GTCTGCAACA CGAC TTGG AC AAGATGGACG GGATCTTCCT CAAACGAGGC 2 00 

60 GATTAT AC TC CCGCTCTCGA TGACGACAAG CTGTGGGACT TCAAGCCTTT 2 50 

610 
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GGCCAATGTG AACGACAATG TGATCGCAGG CTCATGGCTC GGAGAGGTGA 3 00 

CGGAAAATTT CCAACCGCAC AAGATCATGG TACCTTTCGT TTTCGAAGGC 3 50 

AATTACAAGG TGAAGAGTCT GGCCAAAGCC GGTTCGTACA AAGTGAACGA 400 

TGTGATCGCT GTGGTAACGG ATCAGGACGG G AAAG AC C AC AATGTAAC CA 4 50 

5 TGGTGCAGAA ATGGCCGGTG AAACGTGCTA TCACTTGCTA TCGCGAGAAG 500 

CCGCGTCCTT TCAAACTGCT CGAAACGGGT ATCCGTATCA TCG AC AC TTT 550 

CAACCCCATC GTAGAGGGTG GTACGGGATT TATCCCCGGT CCTTTCGGTA 600 

CGGGAAAGAC GGTGCTCCAG CATGCTATCT CGAAGCAGGC GGAAGCCGAT 650 

ATCGTGATCA TTGCAGCCTG TGGCGAGCGT GCAAACGAGG TTGTGGAGAT 700 

10 CTTTGCGGAA TTCCCCCACC TGAATGACCC CCACACGGGA CGTAAATTGA 750 

TGGAACGTAC CATTATTATT GCTAATACGT CGAATATGCC TGTGGCTTCG 800 

CGTGAGGCAT CCGTATATAC GGCCATGACG ATAGCCGAGT ACTATCGCTC 850 

CATGGGCCTT CGCGTGCTGA TGATGGCAGA CTCCACTTCG CGTTGGGCAC 900 

AGGCTCTGCG TGAGATGTCT AACCGTCTGG AAGAGCTTCC CGGACCGGAT 950 

15 GCTTTCCCGA TGGACTTGTC AGCTATCGTA GCCAACTTCT ACGCTCGTGC 1000 

AGGATACGTT TACCTGAACA ACGGTTCGGC CGGTTCGGTA ACGTTCATCG 1050 

GTACGGTATC TCCCGCCGGT GGTAACCTCA AAGAGCCTGT GACGGAAAAC 1100 

ACCAAGAAAG TGGCTCGCTG CTTCTATGCT TTGGAGCAGA ATCGTGCCGA 1150 

CCGCAAACGT TATCCGGCTG TAAACCCCAT CGATAGTTAC TCGAAGTACA 1200 

2 0 TCGAATATCC CGAATTCGAG AGC TAT AT AT CGAACCACAT CAGTTTACTC 12 50 

ATTATATTCA ACAGAAGTTG G C AG ATATAT GGATCAAATC TTACAGCAGA 1300 

CTGGACTACT AAGGTGAATG AGC TGAAGAT GCGCTTGCAT CAGGGTAAAG 1350 

AAAT 1354 



25 



40 



2 ) INFORMATION FOR SEQ ID NO : 716 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1788 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: Type 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716 



TTTGACTCAA GGGAAGATTA TAAAAGTATC GGGACCTCTA GTTATTGCAT 50 

CAGGTATGCA GGAGGCTAAT ATTCAAGATA TTTGCCGTGT AGGTAAGCTA 100 

45 GGGTTAATCG GTGAAATTAT TGAAATGAGA AGAGATCAGG CATCTATCCA 150 

AGTCTATGAA GAAACATCTG GTCTTGGTCC GGGAGAACCT GTTGTTACAA 2 00 

CTGGAGAACC TCTCTCGGTT GAATTAGGGC CAGGATTGAT TTCTCAAATG 25 0 

TTTGATGGCA TACAACGCCC ATTAGATCGA TTTAAATTGG CTACTCATAA 300 

TGATTTTCTA GTTCGTGGGG TAGAAGTTCC AAGTTTGGAT AGAGATATTA 350 

5 0 AGTGGCATTT TGATTCCACT ATAGCAATTG GTCAAAAAGT GAGTACGGGT 400 

GATATTCTTG GAACTGTCAA GGAAACCGAG GTAGTTAATC ATAAAATTAT 450 

GGTTCCTTAT GGAGTATCTG GAGAAGTCGT TTC TATTGC A TCTGGC G ATT 500 

TTACAATTGA TGAAGTTGTA TATGAAATAA AAAAATTGGA CGGTAGTTTC 55 0 

TATAAAGGAA CGCTTATGCA AAAATGGCCT GTCCGCAAGG CGCGTCCTGT 600 

55 TTCTAAACGT TTAATTC C AG AAGAACCATT AATCACAGGT CAACGAGTTA 650 

TTGATGCATT CTTTCCAGTA AC C AAAGGGG GAGCTGCAGC AGTTCCTGGA 7 00 

CCGTTTGGAG CAGGAAAGAC AGTTGTACAA CACCAAGTAG CTAAATTTGC 7 50 

CAATGTTGAT ATTGTTATTT ATGTCGGTTG TGGAGAACGT GGAAATGAAA 800 

TGACGGATGT ACTGAATGAG TTTCCTGAGT TGATTGACCC TAATACCGGA 850 

60 CAATCAATTA TGCAACGGAC AGTTCTGATT GCTAATACTT CAAATATGCC 900 

611 
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TGTTGCTGCT CGTGAGGCTT CAATTTATAC 
ATTTTCGTGA TATGGGCTAC TCTGTCGCCA 
CGTTGGGCAG AAGCGCTACG TGAAATGTCA 
TGGTGATGAG GGTTATCCTG CTTATCTGGG 
5 ATGAAAGAGC AGGACGTTCT CAGGTTCTAG 
ACGATTACTG CTATTGGAGC TGTATCGCCA 
ACCAGTTACT CAAAACACTT T AC GG ATTGT 
ATGCTCCGTT GGCACAGCGA CGTCATTTTC 
TCTTATTCAC TATATAAAGA CAGTGTGGGC 

10 GAAGACAGAT TGGAATAGTA AAATAACTCG 
GGG AATC TAG TTTAGAGGAA ATTGTTCGTC 
TCTGATAATG AACGACTAAC GATGGAAATT 
TTATTTGCAA CAGAACGCTT TTGATTCGGT 
CAAAACAAGA AGCAATGCTA AGTAATATTC 

15 AATCATGCTT TAGAGTTGGG TTCTTACTTT 
CGTGGCAGTT CGAGACCGTA TGGCGAGAAG 
GATTAGATGA AATCAAAATT ATATCAAATG 
TTGATATTAG AAACAGGAGG TCTATAAATG 



AGGAATTACC ATGGCTGAGT 950 

TTATGGCTGA TTCAACTTCA 1000 

GGACGTCTAG AAGAAATGCC 1050 

AAGTCGTATC GCTGAATATT 1100 

GGCTTCCAGA ACGTGAAGGA 1150 

CCTGGTGGAG ATATTTCAGA 12 00 

GAAAGTTTTT TGGGGGCTTG 12 50 

CTGCAATTAA CTGGCTTACA 1300 

ACTTATATAG ATGGTAAAGA 13 50 

TGCGATGAAC TACTTACAAC 14 00 

TTGTTGGAAT TGATTCTCTG 1450 

GC TAAAC AAA TTCGAGAAGA 15 00 

AGATACATTC ACTTCGTTTG 1550 

TCACTTTTGC TGATCAGGCA 1600 

ACAGAGATTA TGGAAGGTAC 1650 

TAAATATGTT TCAGAAGATA 17 00 

AGATTACACA TC AAATTC AT 17 50 

AGTGTTAT 17 88 



20 



35 



2 ) INFORMATION FOR SEQ ID NO: 717 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 

(B) STRAIN: GB8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717 



CGATCCTGGT GTGCTCGGCC GCTGACGGCC CGATGCCGCA AACGCGTGAG 50 

CACATCCTGC TGGCGCGTCA GGTCGGTGTG CCGTACATCA TCGTGTTCCT 100 

40 GAACAAGTGC GACATGGTGG ACGACGCGGA GCTGCTCGAG C TGGTC G AAA 150 

TGGAAGTGCG CGAACTGCTG TCGAAGTACG ACTTCCCGGG CGACGACACG 2 00 

CCGATCATCA AGGGTTCGGC GAAGCTGGCG CTGGAAGGCG ACAAGGGCGA 2 50 

GCTGGGCGAA GTGGCGATCA TGAACCTGGC CGACGCGCTG GACACGTACA 3 00 

TCCCGACGCC GGAGCGTGCG GTCGACGGCG CGTTCCTGAT GCCGGTGGAA 35 0 

45 GACGTGTTCT CGATCTCGGG CCGTGGTACG GTGGTGACGG GTCGTGTCGA 400 

GCGCGGCGTG ATCAAGGTTG GCGAGGAAAT CGAAATCGTC GGTATCAAGG 45 0 

CGACGGCGAA GACGACCTGC ACGGGCGTGG AAATGTTCCG CAAGCTGCTG 500 

GACCAGGGTC AGGCGGGCGA CAACGTCGGT ATCCTGCTGC GCGGCACAAA 550 

GCGTGAAGAC GTGGAGCGCG GCCAGGTTCT GGCGAAGCCG GGTTCGATCA 600 

50 CGCCGCACAC GCACTTCACG GCAGAAGTGT ACGTGCTGAG CAAGGACGAA 65 0 

GGCGGCCGCC ACACGCCGTT CTTCAACAAC TACCGTCCGC AGTTCTACTT 7 00 

CCGTACGACG G ACGTGAC GG GCTCGATCGA GCTGCCGAAG GACAAGGAAA 750 

TGGTGATGCC GGGCGACAAC GTGTCGATCA CGGTGAAGCT GATCGCGCCG 800 

ATCGCGATGG AAGAAGGTCT GCG 823 

2 ) INFORMATION FOR SEQ ID NO: 718 

60 (i) SEQUENCE CHARACTERISTICS: 

612 
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(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia pseudomallei 
10 (B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718 

GCGATCCTGG TGTGCTCGGC CGCTGACGGC CCGATGCCGC AAACGCGTGA 50 

15 GCACATCCTG CTGGCGCGTC AGGTCGGTGT GCCGTACATC ATCGTGTTCC 100 

TGAACAAGTG CGACATGGTG GACGACGCGG AGCTGCTCGA GCTGGTCGAA 150 

ATGGAAGTGC GCGAACTGCT GTCGAAGTAC GACTTCCCGG GCGACGACAC 200 

GCCGATCATC AAGGGTTCGG CGAAGCTGGC GCTGGAAGGC GACAAGGGCG 2 50 

AGCTGGGCGA AGTGGCGATC ATGAACCTGG CCGACGCGCT GG AC AC GT AC 3 00 

2 0 ATCCCGACGC CGGAGCGTGC GGTCGATGGC GCGTTCCTGA TGCCGGTGGA 350 

AGACGTGTTC TCGATCTCGG GCCGTGGTAC GGTGGTGACG GGTCGTGTCG 400 

AGCGCGGCGT GATCAAGGTT GGCGAGGAAA TCGAAATCGT CGGTATCAAG 45 0 

GCGACGGCGA AGACGACCTG CACGGGCGTG GAAATGTTCC GCAAGCTGCT 500 

GGATCAGGGT CAGGCGGGCG ACAACGTCGG TATCCTGCTG CGCGGCACGA 550 

2 5 AGCGTGAAGA CGTGGAGCGC GGCCAGGTTC TGGCGAAGCC GGGTTCGATC 600 

ACGCCGCACA CGCACTTCAC GGCTGAAGTG TACGTGCTGA GC AAGGACGA 650 

AGGCGGCCGC CACACGCCGT TCTTCAACAA CTACCGTCCG CAGTTCTACT 7 00 

TCCGTACGAC GG AC GTGACG GGCTCGATCG AGCTGCCGAA GGACAAGGAA 750 

ATGGTGATGC CGGGCGACAA CGTGTCGATC AC GGTG AAGC TGATCGCGCC 800 

30 GATCGCGATG GAAGAAGGTC TGCG 824 



35 



2) INFORMATION FOR SEQ ID NO: 719 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Clostridium beijerincki (deposited as 

Clostridium butyricum) 
(B) STRAIN: ATCC 8260 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719 



TGTATCAGCA GCAGATGGTC CAATGCCACA AACAAGAGAA CATATACTAC 50 

TAGGATCAAG AGTTGGTATC CAATATATCG TAGTATTCTT AAATAAAGCA 100 

GATATGGTAG ACGATCCAGA ATTATTAGAA TTAGTAGAAA TGGAAGTAAG 150 

AGAATTATTA AGCGAATATG ACTTCCCAGG AGACGATATT CCAGTAATAA 200 

55 CAGGATCAGC ATTAAAAGCA TTAGAAAATC CAACAGATGA AGAAGCAATT 250 

AAGCCAATCA TGGATTTAAT GG AAGCAGTA GATAGCTATA TCCCAACTCC 3 00 

AGAAAGAGCA ACAGATAAGC CATTCTTAAT GCCAATCGAA GATGTATTCA 3 50 

CAATTACAGG AAGAGGAACA GTTGCAACAG GAAGAGTTGA AGCTGGAGTA 400 

CTTCATGTAG GAGATGAAGT AGAAATCGTT GGATTAACAG AAGAAAAGAA 450 

60 GAAAGTTGTA GTAACTGGAA TCGAAATGTT CAGAAAGTTA TTGGATGAAG 500 

613 
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CACAAGCTGG AGATAACATC GGAGCATTAT 
GATATTGAAA GAGGTCAAGT TTTATCAAAA 
C AC T AAATTT GTAGGTCAAG TATACGTACT 
GACATACTCC ATTCTTTGAT GGATACAGAC 
5 AC AG AC GTT A CAGGRTCAAT CAAGTTACCA 
GCCTGGAGAT CACATTGATA TGAATGTTGA 



TAAGAGGAGT TCAAAGAACT 550 

CCAAATTCAG TACACCCTCA 600 

TAAAAAAGAA GAAGGTGGAA 650 

CACAATTCTA TTTCAGAACA 7 00 

GATGGAATGG AAATGGTAAT 7 50 

ATTAATCACT CCAATCGCAA 800 



10 2 ) INFORMATION FOR SEQ ID NO: 720 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 799 bases 

(B) TYPE : Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 



25 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720 

GGTGCTATCC TGGTTGTTGC TGCATCTGAT GGTCCTATGC 
TGAGCACATC CTGCTTGCTC GTCAGGTAGG TGTTCCTTAC 
TCCTGAACAA ATGCGACATG GTTGATGACG AAGAACTGAT 
GAAATGGAAG TACGTGAGCT GTTAAGC GAG TACGGATTCG 



CTCAGACTCG 


50 


ATCGTTGTAT 


100 


CGACCTTGTT 


150 


ACGGAGATAA 


200 


GGTGACGACA 


250 


TGAATTCATC 


300 


CTGTAGAAGA 


350 


CGTGTTGAGC 


400 


TATCAAGGAT 


450 


AGCAGCTGGA 


500 


GGTATCAACC 


550 


ATCCGTACAT 


600 


AAGAAGAAGG 


650 


TTCTACTTCC 


700 


AACTGAAATG 


750 


TTGCTCCAA 


799 



45 2 ) INFORMATION FOR SEQ ID NO : 721 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

ID) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721 

614 
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TTGTATCAGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATCCTA 50 

TTAGCATCAA GAGTAGGAGT TAACCACATA GTAGTATTCT TAAACAAAGC 100 

AGACCAAGTA GATGATCCAG AATTAC TAG A ATTAGTAGAA ATGGAAGTAA 150 

GAGAATTATT AAGCGAATAC GGATTTGACG GAGACGAATG TCCAGTAGTA 2 00 

5 GTAGGATCAG CATTAAAAGC AATCGAAGAA GGGGATGACC AATGCATCCT 250 

AG AC TTAATG AAAGC TGTAG ATGAATATAT CCCAACTCCA GAAAGAGCAA 3 00 

CAGATCAACC ATTC TTAATG CCTGTAGAAG ATGTATTTAC AATTACAGGA 350 

AGAGGAACAG TTGCAACAGG AAGAGTTGAA AGAGGAGTAC T AC AC GT AGG 4 00 

AGATGAAGTA CAAATCGTAG GAATGAAAGA AG AAATC GGA AAGACAACAA 450 

10 TCACAGGAGT AGAAATGTTC AGAAAGATGT TAGATGAAGC AATGGCTGGA 500 

GATAACATCG GAGCATTATT AAGAGGAGTA CAAAGAGACG AAATCGAAAG 550 

AGGTCAAGTA CTAGCAAAAC CAGGTTCAGT AACACCTCAC AAAAAATTCG 600 

TAGGTCAAGT TTACGTATTA AAGAAAGAAG AAGGTGGAAG ACACACTCCA 650 

TTCTTTAACG GATACAGACC ACAATTCTAC TTCAGAACAA CAGACGTAAC 700 

15 AGGATCAATC GCTTTACCAG AAGGAGTAGA AATGGTAATG C C AGG AG AC C 750 

ATATAGACAT GAACGTAGAA TTAATCACAC CAGTAGCAA 789 



20 2 ) INFORMATION FOR SEQ ID NO: 722 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722 



GTTCAGCAGC AGACGGTCCA ATGCCACAAA CAAGAGAACA TATACTACTA 5 0 

GCATCAAGAG TTGGTGTTGA CTATATCGTA GTATTCTTAA ACAAGGCAGA 100 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTAGAAATG GAAGTTAGAG 150 

AATTATTATC AGAATACAAC TTCC C AGG AG ATG AT ATTC C AGTAATCAAG 200 

40 GGATCAGCTT TAGTAGCATT AGAAAACCCA ACAGATGAAA AATCAATCGC 250 

TCCAATCTTA GAATTAATGG AAGCAGTAGA TAGCTACATT CCAACACCAG 3 00 

AAAGAGCAAC AGATAAGCCA TTCTTAATGC CAGTAGAAGA TGTATTCACA 3 50 

ATAACTGGTA GAGGAACAGT TGCAACAGGA AGAGTTGAAA GAGGAGTTCT 400 

TCATGTAGGA GACGAAGTAG AAATCGTTGG ATTATCAGAA GAAAGCAGAA 450 

45 AAGTAGTAGT AACAGGAATA GAAATGTTCA GAAAGTTACT AGACGAAG C A 500 

CAAGCTGGAG ATAATGTTGG AGTACTTTTA AGAGGTGTTC AAAGAACAGA 550 

TATCGAAAGA GGTCAAGTAT TAGCAAAGAC TGGATCAGTT AAGCCACACA 600 

GCAAGTTCGT AGGTCAAGTA TACGTACTTA AGAAAGAAGA AGGTGGAAGA 650 

CATACTCCAT TCTTCGATGG ATACAGACCA CAATTCTACT TCAGAACAAC 7 00 

50 AG ACGTTAC T GGATCAATCA AATTACCAGA CGGAATGGAA ATGGTTATGC 75 0 

CAGGAGACCA CATTGATATG AACGTTGAAT TAATCACTCA AGTAGCAA 798 



55 2 ) INFORMATION FOR SEQ ID NO : 723 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

615 
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(D) TOPOLOGY: Linear 



10 



15 



20 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Clostridium tertlum 

(B) STRAIN: ATCC 14573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723 

GTTCAGCAGC AGATGGTCCA ATGCCTCAAA CAAGAGAACA 
GCTTCAAGAG TTGGTGTTGA CTACATAGTT GTTTTCTTAA 
TATGGTAGAT GACGAAGAAT TATTAGAATT AGTTGAAATG 
AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC 
GGTTCAGCTT TACAAGCATT AGAAAACCCA ACAGATGAAA 
TCCAATCCTT GAG TT AATGG AAGCTGTAGA TAGCTACATT 
AAAGAGCAAC AG ATAAGC C A TTC TTAATGC CAGTAGAAGA 
ATC AC TGGTA GAGGAACAGT TGCTACAGGA AGAGTTGAAA 
TCACGTAGGA GACGAAGTAG AAATCGTTGG ATTATCAGAA 
AAGTAGTAGT AACAGGAATA GAAATGTTCA GAAAGTTACT 
CAAGCTGGAG ACAACGTAGG AGTTCTTTTA AGAGGAGTTC 
CATCGAAAGA GGTCAAGTTT TAGCAAAAGT TGGATCAGTT 
AGAAATTTGT AGGTCAAGTA TACGTACTTA AAAAAGAAGA 
CATACTCCAT TCTTCGATGG ATACAGACCA CAATTCTACT 
AGAYGTTACT GGTTCAATCA AGTTACCAGA TGGAATGGAA 
CAGGAGACCA CATTGATATG AACGTTGAAT TAATCACTCA 



CATACTACTA 50 

ACAAGGCAGA 100 

GAAGTAAGAG 150 

AGTAATAAAG 200 

AAGCAATCGC 250 

CCAACTCCAG 3 00 

TGTATTCACA 350 

GAGGAGTTCT 400 

GACAGCAGAA 45 0 

AG AC G AAGCG 500 

AAAGAACTGA 550 

AAGCCACACA 600 

AGGTGGAAGA 65 0 

TCAGAACAAC 7 00 

ATGGTTATGC 7 5 0 

AGTAGCTAT 799 



30 2 ) INFORMATION FOR SEQ ID NO: 724 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 19406 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 724 



TAGTAAGTGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATACTA 50 

TTAGCATCCA GAGTTGGAGT TGAGCACATA GTAGTATTCT TAAATAAAGC 100 

AGACCAAGTA GATGACGCAG AGTTAATCGA ATTAGTAGAA ATGGAAGTAA 150 

GGGAATTAAT GAACGAATAC GGATTCCCAG GAGATGACGC ACCAGTAGTA 200 

50 GTAGGATCCG CATTAAAAGC ATTAGAAAAT CCAGAAGATG ATGCAGCAAC 250 

ACAATGCATA ATGG AC TTAA TGGCAGCAGT AGATGAATAT ATACCAACAC 3 00 

CAGAAAGAGC AACAGATAAG CCATTCTTAA TGCCAGTAGA AG AT ATC TTC 3 50 

ACAATCACAG GAAGAGGAAC AGTTGCAACA GGAAGAGTAG AAAGAGGAAT 400 

TCTAAAAGTA GGAGACGAAA TAGAAATCGT AGGATTAAGT GATGAAAGCA 450 

55 AGAAATCAGT AATCACAGGA ATAGAAATGT TCAGAAAACT ATTAGATGAA 5 00 

GCACAAGCAG GAGATAACAT CGGAGCATTA TTAAGAGGTG TTC AAAG AG A 550 

TGAAATCCAA AGAGGTCAAG TATTAGCAGC AACAGGATCA GTAAAAC C AC 600 

ATAAGAGTTT TACAGGTCAA GTATATGTAT TAAAGAAAGA AGAAGGAGGA 650 

AGACACACTC CATTCTTTAA CGGATACAGA CCACAATTCT ACTTTAGAAC 700 

60 AACAGACGTA ACAGGTTCAA TCGCACTACC AGAAGGAGTA GAAATGGTAA 7 50 

616 
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15 



45 



TGCCAGGAGA CCACATAGAC ATGAAGGTAG AATTAATAAC AAGAGTAGCA 8 00 

A 801 



2 ) INFORMATION FOR SEQ ID NO: 72 5 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 633 bases • 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 5 

TATGCCTCAA AC TCGTG AAC ACATCTTGTT ATCTCGTAAC GTTGGTGTTC 50 

CTTACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGATGAAGAA 100 

TTACTAGAAT TAGTTGAAAT GGAAGTTCGT GACTTATTGT C AG AATAC G A 150 

25 CTTCCCAGGC GACGACACTC CAGTTATCGC TGGTTCAGCT TTGAAAGCTT 2 00 

TAGAAGGCGA TGCTTCATAC GAAGAAAAAA TCTTAGAATT AATGGCTGCT 2 50 

GTTGATGAAT ATATCCCAAC ACCAGTTCGT GATACTGACA AACCATTCAT 3 00 

GATGCCAGTC GAAGATGTAT TCTCAATCAC TGGTCGTGGA AC TGTTGC AA 350 

CTGGTCGTGT TGAACGTGGA CAAGTTCGCG TTGGTG AC G A AGTTGAAATC 400 

30 GTTGGTATTG CTGAAGCAAC TGCTAAAACA AC TGTT AC AG GTGTTGAAAT 450 

GTTCCGTAAA TTGTTAGATT ACGCTGAAGC AGGCGATAAC ATTGGTGCAT 5 00 

TGTTACGTGG TGTTGCACGT GAAGACATCC AACGTGGACA AGTATTGGCT 5 50 

AAACCAGCTT CAATCACTCC ACATACAAAA TTCTCTGCAG AAGTTTACGT 600 

TTTAACTAAA GAAGAAGGCG GACGTCATAC TCC 633 

35 

2 ) INFORMATION FOR SEQ ID NO: 72 6 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enter o coccus sulfurevs 
50 (B). STRAIN: ATCC 49903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726 

CACGTGAACA CATCTTGTTA TCTCGTAACG TAGGTGTTCC TTACATCGTT 50 

55 GTATTCTTAA ACAAAATGGA TATGGTTGAT GACGAAGAAT TATTAGAATT 100 

AGTAGAAATG GAAGTTCGTG ACTTATTATC AGAATACGAT TTCCCAGGCG 150 

ATGACACTCC AGTTGTTGCA GGTTCTGCTT TGAAAGCTTT AG AAGG CG AC 2 00 

GCTTCTTACG AAGAAAAAAT CATGGAATTA ATGGCTGCAG TTGACGAGTA 2 50 

CATCCCAACT CCAACTCGTG AC AC TG AC AA ACCATTCATG ATGCCAGTTG 300 

60 AGGATGTATT CTCAATCACT GGACGTGGTA CTGTTGCTAC AGGTCGTGTT 3 50 

617 
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GAACGTGGAC AAGTTCGCGT TGGTGACGTT 
TGACGAAACT GCTCAAACAA CTGTAACAGG 
TATTAGACTA CGCTGAAGCA GGCGATAACA 
GTTGCTCGTG AAGACATCCA ACGTGGACAA 
5 AATCACTCCA CATACAAAAT TC TCTGCTG A 
AAGAAGGTGG AC GTC AT ACT CCA 



GTAGATATCG TTGGTATCGC 4 00 

TGTTGAAATG TTCCGTAAAT 450 
TCGGTGCTTT ATTACGTGGT 5 00 

GTTTTAGCTA AACCAGCTTC 5 50 

AGTATACGTA TTAAGCAAAG 600 

623 



10 2 ) INFORMATION FOR SEQ ID NO : 727 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) STRAIN: ATCC 49156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727 

25 



CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

AT ATGAC TTC CCAGGCGACG ATGTTC CTGT AATCGCTGGT TCTGCTTTGA 200 

3 0 AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC * 300 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 400 

GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 450 

3 5 TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA TTCTTC 646 

40 

2) INFORMATION FOR SEQ ID NO: 728 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 823 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma pirum 

(B) STRAIN: ATCC 25960D 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 28 

GAGCAATTTT AGTTGTTTCT GCAACTGATG GTC C AATGCC TCAAACTCGT 50 

GAACATATCT TATTAGCACG CCAAGTTGGT GTTCCTAAAA TGGTTGTTTT 100 

CTTAAACAAA TGTGATGTTG CTTCTGATCC AGAAATGCAA GAATTAGTTG 150 

60 CTGAAGAAGT AAAAG AC TTA TTAAAATCTT ATGGTTTTGA TGGTGACAAT 200 

618 
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ACTCCAATTA TTCGTGGTTC AGCATTAGAA GCATTAAATG GTAAACCTGA 2 50 

ATGAGAAGAA AAAATTAAAG AATTAATGAA GGCAGTGGAT GACACTATTC 3 00 

CTGATCCAGT TCGTGATACT GAAAAGCCAT TCTTGTTACC AATTGAAGAC 350 

GTAATGACAA TTACAGGTCG TGGTACTGTT GTTACAGGTC GTGTAGAACG 400 

5 TGGTACTCTA AAATTAAATG ATGAAGTTGA AATTGTTGGT TTAGGTGAAA 450 

CATTTAAATC TG TTGTAAC A GGTATTGAAA TGTTCCGTAA AGAATTAGAT 5 00 

GAAGCTCGTG CTGGTGACAA TGCTGGTATT TTACTTCGTG GTGTTGACCG 550 

TGGTCAAGTA CAACGTGGTC AAGTTCTTGC TAAACCAGGT TCTATTACTC 600 

CTCATACTAA ATTTAAAGCT GAAATTTATG CTTTGAAAAA AGAAGAAGGT 6 50 

10 GGTCGTCATA CTGCTTTCTT AAACGGTTAT CGTCCTCAAT TCTATTTCAG 7 00 

AACAACTGAT GTTACAGGTT CTATTAAATT AAAAGATGGA ACTGAAATGG 7 50 

TTATGCCTGG TGACAATACT GAAATCACTG TAGAATTAAT TTCACCAATT 800 

GCTTGTAAAA AGGAAGTAAG TTT 823 



15 



30 



2 ) INFORMATION FOR SEQ ID NO: 729 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma salivarlum 

(B) STRAIN: ATCC 23 064 



-Cxi")— SEQUENCE—DESCRIPTION-: - SEQ -I-B-N0-:— 729 



GGAGCAATCT TAGTTGTTGC TGCAACTGAT GGTGCGATGC CTCAAACTCG 50 

TGAACACGTT TTACTTGCAA AACAAGTTGG TGTTC CT AAA ATCGTTGTTT 100 

35 TCTTAAACAA AATCGATATG TTCAAGCCAG AAGAAAGAGC CGAAATGGTT 150 

GAAATGGTTG AAATGGACAT TCGTGACTTA TTAAACAAAT ACGACTTTGA 2 00 

TGGTGACAAT ACTCCTATTA TTGC TGGTTC AGCATTAAAA GCATTACAAG 2 50 

GTGATCCAGA ATATGAAAAG AATATTC TAG AACTTATGGA TGCAGTAGAC 3 00 

ACATACATTG ATGAACCTAC TCGTGATAAA GATAAACCAT TCTTAATGGC 3 50 

40 TGTAGAAGAC GTATTTACAA TTACTGGTCG TGGAACTGTT GCTACTGGTA 400 

GAGTAGAACG TGGTACATTA CATCTAAATG ATGAAGTTGA AATCGTTGGT 450 

CTACGTCCAA CAATTAAAAC TGTTGTTACT GGAATTGAAA TGTTCCGTAA 5 00 

AAATTTAAAA GAAGCTCAAG CTGGAGATAA TGCAGGACTT TTACTACGTG 5 50 

GAATTGATAG AGATCAAGTA GAAC GTGG AC AAGTTTTGGC CAAACCAAAA 600 

45 AGTATTATTC CTCACACAGA ATTTGAAGCT GCTGTGTATG TTC TAAAAGC 650 

TGAAGAAGGT GGACGTCACA CTCCATTCTT TGAACACTAT AAACCACAAT 7 00 

TTTACTTTCG TACAACCGAC GTTACTGGTG GAATTAAATT CAAACCTGGA 7 50 

CGTGAAATGG TTATGCCTGG CGAAAATGTT GAATTTACAG TTACTTTAAT 800 

TGCTCCTATT GCAGTTGAAG AAGGAA 826 

50 

2 ) INFORMATION FOR SEQ ID NO: 730 

5 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 



619 
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(ii) MOLECULE TYPE; Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730 

TGGTATGTTC CGCAGCYGAY GGYCCTATGC CTCAAACTCG CGAACACATC 5 0 

CTGYTGGCTC GCCAAGTAGG YGTACCTTAC ATCATCGTRT TCATGAACAA 100 

ATGCGACATG GTTGACGATG CCGAGCTGTT GGAACTGGTT GAAATGGAAA 150 

TCCGYGACCT GYTGTCMAGC TACGACTTCC CMGGCGACGA CTGCCCAATC 2 00 

GTACAAGGTT CTGCACTGAA AGCTTTGGAA GGYGAYGCTG GTTACGAAGA 2 50 

GAAAATCTTC GAAYTGGCTG CTGCTTTGGA CAGCTACATC CCAACTCCTG 3 00 

AGCGTGCTGT GGACAAACCK TTCYTGYTGC CTATCGAAGA CGTRTTCTCT 3 50 

ATCTCTGGYC GYGGTACMGT AGTAACYGGY CGTGTAGAGC GC GGTRTC AT 400 

CCACGTTGGT GACGAGATYG AAATCGTAGG TCTGAAAGAA ACCCAAAAAA 450 

CCACTTGTAC CGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGTCAA 500 

GCWGGTGACA ACGTAGGCGT ATTGYTGCGT GGTACCAARC GTGAAGACGT 550 

AGAGCGTGGT CAAGTATTGG CYAAACCAGG YACCATYACT CCSCACACCA 600 

AATTCAAAGC AGAAGTATAC GTACTGAGCA AAGAAGAGGG TGGTCGYCAC 650 

ACTCCATTCT TCGCYAACTA CCGYCCMCAA TTCTACTTCC GTACYACTGA 7 00 

CGTAACYGGT GCAGTTACTT TGGAAGAAGG TGTAGAAATG GTAATGCCAG 750 

GTGAGAACGT AACCATTACY GTAGAAC TGA TTGCGCCTAT CGCTAGGAAG 800 

AAGGTCTGCG 810 



2 ) INFORMATION FOR SEQ ID NO : 7 31 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: . Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Enteritidis 

(B) STRAIN: ATCC 13076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC GTGAGCACAT 
CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG TTCCTGAACA 
AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT TGAAATGGAA 
GTTCGYGAAC TGCTGTCTCA GTACGACTTC CCGGGCGACG ACACTCCGAT 
CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA GAGTGGGAAG 
CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT CCCGGAACCA 
G AGCGTGC G A TTGACAAGCC GTTCCTGCTG CCGATCGAAG ACGTATTCTC 
CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA CGCGGTATCA 
TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA GACTCAGAAG 
TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG ACGAAGGCCG 
TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA C GTG AAG AAA 
TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA GCCGCACACC 
AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG GCGGCCGTCA 
CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC CGTACTACTG 
ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT GGTAATGCCG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
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GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA TCGCAATGGA 800 
CGACGGTCTG CGT 813 



2 ) INFORMATION FOR SEQ ID NO : 732 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Gallinarum 

(B) STRAIN: ATCC 9184 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732 



CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC GTGAGCACAT 50 

CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG TTCCTGAACA 100 

25 AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT TGAAATGGAA 150 

GTTCGYGAAC TGCTGTCTCA GTACGACTTC CCGGGCGACG ACACTCCGAT 2 00 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA GAGTGGGAAG 250 

CG AAAATC AT CGAACTGGCT GGCTTCC TGG ATTCTTACAT CCCGGAACCA 300 

GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG ACGTATTCTC 3 50 

30 CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA CGCGGTATCA 400 

TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA GACTCAGAAG 450 

TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG ACGAAGGCCG 500 

TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA CGTGAAGAAA 550 

TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA GCCGCACACC 600 

3 5 AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG GCGGCCGTCA 6 50 

CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC CGTACTACTG 7 00 

ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT GGTAATGCCG 750 

GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA TCGCAATGGA 800 

CGACGGTCTG CG 812 

40 

2 ) INFORMATION FOR SEQ ID NO: 733 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
55 serotype Paratyphi B 

(B) STRAIN: ATCC 87 59 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 3 

60 TGGTTGTTGC TGCGACTGAC GGYCCGATGC CGCAGACCCG TGAGCACATC 50 

621 



CA 02307010 2000-05-19 



CTGCTGGGTC GTCAGGTAGG CGTTC CGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGYGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 2 00 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 2 50 

5 GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG 300 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 3 50 

ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 400 

CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT 4 50 

CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

10 GCYGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 7 00 

CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

15 GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCAATGGAC 800 

GACGGTCTGC GTTT 814 



20 2 ) INFORMATION FOR SEQ ID NO: 734 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) — ORGANISM: Salmonella -choleraesuis- subsp . choleraesuli 

serotype Virchow 
(B) STRAIN: ATCC 51955 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 15 0 

40 GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 2 00 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 2 50 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 400 

45 GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

50 CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 7 00 

GTACTACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCAATGGAC GACGGTCTGC GTTTCGCA 82 8 

55 

2 ) INFORMATION FOR SEQ ID NO : 735 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 825 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimes ii 

(B) STRAIN: ATCC 144 60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735 



GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTTC ATCATCGTAT 100 

15 TCATGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGTGAACT TCTGTCTGCT TATGACTTCC CTGGTGATGA 200 

CCTGCCAGTT GTTCGTGGTT CAGCGCTGAA AGCACTGGAA GGCGAAGCTG 250 

AGTGGGAAGC TAAAATCATC GAACTGGCTG GCTACCTGGA TTCTTACATC 3 00 

CCAGAACCAG AGCGTGCTAT CGACAAGCCG TTCCTGCTGC CAATCGAAGA 3 50 

2 0 CGTATTCTCC ATCTCCGGYC GTGGTACYGT AGTTACCGGT CGTGTAGAGC 400 

GCGGTATCGT TAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 450 

ACCGTTAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGACAT CGAACGTGGT CAGGTACTGG CTAAACCAGG TTCAATCAAG 600 

25 CCACACACCA AATTCGACTC AGAAGTTTAC ATCCTGAGCA AAGAAGAAGG 650 

TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 7 50 

GTAATGCCAG GCGATAACGT GAACATGGTT GTAACCCTGA TTCACCCAAT 800 

CGCGATGGAC GACGGTCTGC GTTTC 82 5 

30 



2 ) INFORMATION FOR SEQ ID NO: 736 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 736 



TATTAGTTTG 
50 ATACTATTAT 
CAAATGTGAC 
AAGTAAGAGA 
ATAGTAAGAG 
GGGAGATAAG 
55 CTCCAGAGAG 
TTCTCAATCA 
AGTACTAAAA 
CAAGAAAAGT 
CAAGCACAAG 
60 AAACGAGATA 



TTCAGCAACA 
CAAGACAAGT 
ATGGTAGATG 
TTTATTAACA 
GTTCAGCATT 
ATAGTAGAAT 
AGATACAGAT 
CAGGAAGAGG 
GTACAAGACG 
AGTAGTAACA 
CAGGGGATAA 
GAAAGAGGAC 



GATGGACCAA 
TGGAGTACCA 
ATGAAGAGTT 
GAATATGATT 
AATGGCATTA 
TATTCGAGCA 
AAACCATTCT 
AACAGTTGCA 
AAGTAGAKTT 
GGAGTAGAGA 
TATAGGAGCA 
AAGT AC TAGC 



TGCCACAAAC 
TATATAGTAG 
ATTAGAGTTA 
TCCCAGGAGA 
GAAGATCCAA 
AATAGATGAG 
TAATGCCAGT 
ACAGGAAGAG 
AGTAGGATTA 
TGTTCAGAAA 
TTATTAAGAG 
AAAGACTGGA 



AAGAGAGCAT 50 

TATTCTTAAA 100 

GTAGAGATGG 150 

TGACACTCCA 200 

AGAGYGAGTG 250 

TATATACCAG 300 

AGAGGACGTA 350 

TGGAAAGAGG 400 

ACAGAAGCAC 450 

ATTATTAGAC 500 

GAGTACAAAG 550 

TCAGTAAAGG 600 
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CACACACAAA GTTTACAGCA GAAGTATATG 
GGAAGACATA CACCATTCTT TGATGGATAT 
AACAACAGAC GTAACAGGAG CTTGTAAGTT 
TAATGCCTGG AGATAACGTA ACAATGGAAG 



TACTTAAAAA AGAAGARGGT 650 

AGACCACAAT TCTATTTCAG 7 00 

AC C AG AAGGA ATAGAGATGG 7 50 

TAG AC TT AAT AAACTCAA 7 98 



2 ) INFORMATION FOR SEQ ID NO : 737 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia pseudomallei 
20 (B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 737 

GTCAACATGA TGGAGCTCAT CAACAACATC GCGAAGGAGC ACGGCGGTTA 50 

25 CTCCGTGTTC GCGGGCGTGG GCGAGCGTAC CCGTGAAGGG AACGACTTCT 100 

AC C AC GAAAT GAAGGACTCG AACGTTCTCG ACAAGGTCGC GCTGGTGTAC 150 

GGCCAGATGA ACGAGCCGCC GGGCAACCGT CTGCGCGTGG CGCTGACGGG 2 00 

CCTCACGATG GCCGAGCACT TCCGTGACGA AGGCCTCGAC GTGCTGTTCT 250 

TCGTCGACAA CATCTACCGT TTCACGCTGG CCGGTACCGA AGTGTCGGCG 300 

3 0 CTGCTCGGCC GTATGC CGTC GGCAGTGGGC TATCAGCCGA CGCTGGCTGA 350 

AGAAATGGGC-AAGCTGCAAG "AGCGCATCAC GTCGACGAAG AAGGGC TCGA 400 

TCACGTCGGT T 41 1 



35 



45 



60 



2 ) INFORMATION FOR SEQ ID NO : 738 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 394 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium bifermentans 

(B) STRAIN: ATCC 63 8 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 738 

TACAAGAGCT TATTAACAAT ATAGCTACTC AAC AC GGTGG TATATCAGTA 50 

TTCGCAGGTG TTGGAGAGAG AACAAGAGAA GGTAACGACT TATTCCATGA 100 

GATGAGCGAT ACAGGAGTTA TAAATAAAAC AGCTCTAGTA TTCGGACAAA 150 

55 TGAATGAGCC ACCTGGAGCA AGAATGAGAG TTGCTTTAAC TGGTCTTACA 200 

ATGGCTGAAT ACTTCAGAGA TCAACAAGGG CAAGACGTTT TATTATTCGT 250 

AGATAATATA TTC CGTTTC A CTCAAGCAGG ATCTGAGGTT TCTGCACTTC 3 00 

TTGGACGTAC TCCATCAGCA GTTGGATACC AACCAACATT AGCAACAGAG 350 

ATGGGTAGAT TACAAGAGAG AATAACATCT ACAAATAAAG GGTC 394 
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2 ) INFORMATION FOR SEQ ID NO: 739 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium beijerincki (deposited as 
15 Clostridium butyricum) 

(B) STRAIN: ATCC 8260 



(xi) SEQUENCE DESCRIPTION: SEQ 

2 0 TTAATAAACA ACATAGCTAA ACAACATGGT 
AGTTGGTGAA AGATCAAGAG AAGGTAATGA 
AGTCAGGAGT TATTGATAAG ACAGCATTAG 
CCACCGGGTG CCAGAATGAG AGTTGC ATT A 
GTATTTTAGA GATAAAGGTC AAGATGTGTT 

25 TCAGATATAC TCAAGCAGGT TCAGAGGTTT 
CCTTCAGCGG TTGGATATCA GCCAACACTT 
TCAGGAAAGA ATTACATCAA CAGTTAATGG 



ID NO: 739 

GGTTTATCAG TATTTACTGG 50 

C TTATATC AT GAAATGAGAG 100 

TATTTGGACA AATGAATGAG 150 

ACAGGTCTTA CTATGGCAGA 200 

ACTATTCATA GATAACATAT 250 

CAGCATTACT TGGAAGAACA 300 

GCAACTGAAA TGGGTGCACT 3 50 

TTCTATTACG TCAG 394 



30 

2 ) INFORMATION FOR SEQ ID NO: -740 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 393 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740 

TTATAAACAA TATTGC TAAG CAACATGGTG GTATTTCTGT ATTTTCAGGA 50 

GTAGGAGAAA GAACAAGAGA AGGTAACGAC CTTTATGGCG AAATGAGTGA 100 

GTCTGGAGTT ATAAATAAAA CAGCTCTAGT ATTTGGTCAA ATGAATGAAC 150 

50 CACCTGGAGC GAGAATGAGA GTTGCTTTAA CTGGACTTAC AATGGCAGAA 2 00 

CATTTTAGAG ATGAGCAAGG AC AAGAC GTT TT AC TTTTCG TTGATAATAT 2 50 

ATTCCGTTTC ACACAAGCTG GTTCAGAAGT TTCAGCACTT CTAGGACGTA 3 00 

TGCCATCAGC TGTTGG TT AT CAGCCAACAT TAGCTACTGA AATGGGTGCA 3 50 

CTTCAAGAGA GAATAACATC AACTAAGAAA GGTTCAATAA CAT 393 

55 



2 ) INFORMATION FOR SEQ ID NO: 741 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 398 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium ramosum 
10 (B) STRAIN: ATCC 25582 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741 

TTGATTCAAG AATTCATTAA TAACATTGCT ACAGAACATG GTGGTTTATC 50 

15 AGTTTTTGCT GGAGTTGGTG AACGTAGCCG TGAAGGTAAT GATTTATATT 100 

ATGAAATGAA GGAAAGTGGT GTTTTATCTA AAACAACACT AGTATTTGGA 150 

CAGATGAATG AACCCCCAGG AGC TCGTTTA AGAGTTGCTT TAACGGGTCT 2 00 

TACTATGGCA GAAGAATTCC GTGATGAACA AGGTCAGGAT GTCTTATTAT 2 50 

TCATCGATAA TATTTTCCGT TTTACTCAAG CTGGATCTGA AGTATCTGCC 3 00 

20 TTAC TTGG AC GGGTACCATC ACAAGCTGGG TATCAGCCAA CTTTAGCAAC 350 

C G AAATGGGT GCTTTACAAG AAC GG ATT AC ATCAACTAAA AAAGGATC 398 



25 2 ) INFORMATION FOR SEQ ID NO : 742 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 bases 

(B) TYPE: Nucleic acid 

3 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12964 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742 



40 



TAGCTAAGGA ACACGGTGGA CTTTCAGTAT TCACAGGTGT TGGAGAAAGA 50 

TCAAGAGAAG GTAATGATTT ATATTACGAA ATGAAAGAAT CAGGAGTTAT 100 

AGACAAGACA GCTCTAGTGT TTGGACAAAT GAATGAATCT CCAGGAGCTA 150 

GAATGAGAGT ATCTTTAACA GGATTAACTA TGGCTGAATA TTTCAGAGAT 2 00 

45 CAAGGTCAAG ATGTGCTTTT ATTCATAGAT AACATATTTA GATTTACTCA 2 50 

AGCTGGATCA GAAGTATCGG CTTTACTTGG AAGAATACCA TCAGCAGTTG 3 00 

GTTATCAACC AACACTAGCA ACTGAAATGG GTGCACTTCA AGAAAGAATT 3 50 

ACTTCAACTA AAAATGGATC AATAACTTCA 3 80 

50 

2 ) INFORMATION FOR SEQ ID NO: 743 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 3 89 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 743 



TTAATAAATA 
AGTTGGAGAA 

10 AGTCAGGGGT 
TCACCAGGAG 
ATATTTTAGA 
TTAGATTTAC 
CCTTCAGCAG 

15 TCAAGAAAGA 



ATATAGCAAA 
AGGTCAAGAG 
TATAGATAAG 
CAAGAATGAG 
GATCAAGGTC 
TCAAGCGGGA 
TTGGATATCA 
ATAACATCAA 



AGAGCATGGT 
AAGGTAACGA 
ACAGCTTTAG 
AGTTTCATTA 
AAGACGTTCT 
TCAGAAGTTT 
ACCAACTCTT 
CAAAGAATGG 



GGTCTTTCTG 
C TTAT ATT AT 
TATTTGGACA 
ACTGGATTAA 
TTTATTTATA 
CTGCGTTATT 
GCAACTGAAA 
ATCAATCAC 



TATTTAC AGG 5 0 

GAAATGAAAG 100 

AATGAATGAA 150 

CTATGGCTGA 2 00 

GATAATATAT 25 0 

AGGAAGAATT 3 00 

TGGGAGC AC T 350 
389 



20 



2 ) INFORMATION FOR SEQ ID NO : 744 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Comamonas acidovorans 

: ( B ) STRAIN : "ATCC -1^66 8 - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744 

3 5 TTCCCCCGCA CGCATGCCCA AGGTGTTCGA TGCCCTGAAG CTCGACGGCT 50 

CGGCCCTGAC GCTGGAAGTG CAGCAACTGC TGGGTGACGG CGTTGTGCGT 100 

ACCATCGCCC TGGGTTCGTC CGACGGTCTG CGTCGCGGCC TGATGGTGTC 150 

CAACACCGGC AACCCCATCA CCGTGCCCGT GGGCAAGGCG ACGCTGGGTC 2 00 

GCATCATGGA CGTGCTGGGC AATCCCATCG ACGAACGTGG TCCCGTGGAT 2 50 

40 CAGGCGCTGA CGGCTCCCAT CCACCGCAAG GCACCGGCTT ATGAC GAGCT 3 00 

GTCGCCTTCG CAGGAACTGC TGGAAACCGG CATCAAGGTG ATCGACCTGA 3 50 

TCTCGCCCTT CGCCAAGGGC GGCAAGGTGG GTC TGTTCGG TGGCGCCGGT 400 

GTGGGCAAGA CCGTGAACAT GATGGAACTC ATCAACAACA TCGCCAAGGG 450 

CCACGGTGGT CTGTCGGTGT TCGCCGGTGT GGGTGAACGT ACCCGCGAAG 500 

45 GCAATGACTT CTATCACGAA ATGTCGGACG CCGGCGTGGT CAACCAGGAG 550 

TCGCTGAACG ACTCCAAGGT GGCCATGGTC TACGGCCAGA TGAACGAACC 600 

CCCGGGCAAC CGTCTGCGCG TGGCGCTGAC CGGCCTGACC ATGGCCGAAG 650 

CCTTCCGTGA CGAAGGCAAG GACGTGCTGT TCTTCGTGGA CAACATCTAC 700 

CGCTACACGC TGGCCGGTAC CGAAGTGTCC GCTCTGCTGG GTCGCATGCC 750 

50 TTCCGCCGTG GGCTACCAGC CCACGCTGGC CGAGGAAATG GGCCGCCTGC 800 

AAGAGCGCAT CACCTCGACC AAGGTCGGTT CGATCACTTC CAC 843 



55 2 ) INFORMATION FOR SEQ ID NO : 745 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscl eromatis 

(B) STRAIN: ATCC 13884 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745 

GCCGTACCAC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 

TCTGGTGCTG GAAGTTCAGC AGCAGCTGGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCTGAT GGTCTGCGCC GCGGTC TGG A TGTAAAAGAC 150 

15 CTCGAGCACC C GATCG AAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 2 00 

CATGAACGTA CTGGGTCAAC CGGTTGACAT GAAAGGCGAC ATCGGCGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CGTCCTATGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 35 0 

TCCGTTCGCC AAGGGC GGTA AAGTTGGTCT GTTCGGCGGT G C GGGTGTAG 400 

20 GTAAAACTGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

TGGTGTACGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

25 ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACTGAAG 700 

TATCCGCGCT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACC 7 50 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC ACCTCCGTA 819 

30 



45 



2 ) INFORMATION FOR SEQ ID NO: 746 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria canis 

(B) STRAIN: ATCC 14687 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 746 



GCGATTCTAT TCCGCGCGTA TATGATGCTC TTAAACTAGT GGATAGAGAA 50 

CTGACGCTTG AAGTACAACA ACAGTTGGGT GATGGTGTCG TTCGTAC TAT 100 

50 TGCGATGGGT AGTTCCGACG GCCTCAAACG AGGTTTGGCG GTAGTTAACA 150 

CCGGTGCTCC AATTACAGTG CCTGTGGGCA AAGCAACATT AGGC CGTATT 2 00 

ATGGACGTAT TAGGTAATCC GGTTGATGAA GCTGGTCCGA TTGGCTCCGA 2 50 

GCAAACCCGA GCAATCCACC AACCTGCTCC TAAGTTCGAC GAGCTTTCTA 3 00 

GCGCCACAGA GATTTTGGAA ACAGGTATTA AAGTAATTGA TTTGCTTTGC 3 50 

55 CCGTTTGCCA AAGGCGGTAA AGTAGGTTTG TTTGGTGGTG CGGGAGTGGG 400 

CAAAACCGTA AATATGATGG AGTTGATTAA CAACATCGCG AAAGCACACA 450 

GCGGTTTGTC TGTATTTGCC GGTGTGGGTG AACGGACGCG TGAAGGTAAT 500 

GACTTTTATC ATGAGATGAA AG ATTCC AAT GTATTAGATA AAGTAGCCAT 55 0 

GGTTTACGGT CAGATGAACG AGCCTCCCGG TAACCGTTTG CGCGTTGCGC 600 

60 TAACTGGCTT GTCTATGGCC GAATTCTTCC GTGACGAGAA AGATGAAAAC 650 

628 
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GGTAAAGGCC GTGATGTATT 
CCTAGCCGGT ACAGAAGTAT 
TAGGTTATCA GCCGACGTTG 
ATTACYTCAM CCCAAACAGG 



GTTCTTTGTA QACAATATTT 

CTGCATTGCT TGGCCGTATG 

GCAGAGGAAA TGGGCCGCTT 
CTCT 



ACCGCTATAC 7 00 

CCTTCGGCAG 7 50 

GCAAGAGCGT 800 
824 



2 ) INFORMATION FOR SEQ ID NO: 747 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cinerea 
20 (B) STRAIN: ATCC 14685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 747 

CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 5 0 

2 5 TCTGACTCTG GAGGTTCAAC AGCTTCTGGG CGACGGCGTT GTCCGTACTA 100 

TTGCAATGGG TAGTTCAGAC GGCCTTAAAC GCGGTATGTC TGTAAGCAAT 150 

ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 2 00 

TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 2 50 

ACAAAAGCCG TGCCATTCAC CAAACTGCTC C G AAATTCG A CGAGTTGTCT 3 00 

3 0 TCAGCTACCG AATTGTTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 3 50 

TCCGTTTGCT AAAGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 45 0 

AGCGGTCTGT CCGTGTTCGC AGGTGTGGGT GAGCGTACCC GTGAAGGTAA 5 00 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGC C A 550 

3 5 TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTG AC G AAA AAGACGAAAA 650 

CGGTAAAGGC CGCGACGTAT TGTTC TTCGT TGACAACATC TACCGTTACA 7 00 

CTTTGGCCGG TACTGAAGTA TCTGCACTGT TGGGCCGTAT GCCTTCTGCA 7 50 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 800 

4 0 TATTACCTCT ACCCAAACCG GTTCCATTAC T 831 



45 



2 ) INFORMATION FOR SEQ ID NO: 748 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 5 (A) ORGANISM: Neisseria cuniculi 

(B) STRAIN: ATCC 14688 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 748 

60 CCGTGGCCAA GT AC C AC AAA TTTATGACGC ACTGAGTGTT GATGGCACCG 50 

629 
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AAACAACCTT GGAAGTTCAA CAGCAGTTGG GTGATGGCGT GGTGCGT AC C 100 

ATTGCGATGG GTTCAACCGA AGGCTTGAAG CGTGGTTTGA CTGTATCTAA 150 

C TCTGGTGC A CCGATTTCTG TGCCAGTGGG TCAAGCGACT TTGGGTCGTA 2 00 

TTATGGATGT GTTGGGTCGT CCAATCGACG AGGCAGGTCC TGTAAATGCT 250 

5 CAAGAAAAAT GGTCAATTCA CCGTGAAGCA CCAAGCTATG ATGAGCAATC 3 00 

AAACTCAACT G AGC TGC TAG AAACAGGCAT CAAAGTAATT GATTTGCTTT 350 

GCCCATTTGC TAAAGGTGGT AAAGTTGGCT TGTTCGGTGG TGCAGGTGTG 4 00 

GGTAAAACCG TGAATATGAT GGAGCTGATT AATAATATCG CTCTGAAGCA 450 

TTCAGGTCTT TCTGTTTTTG CAGGTGTTGG TGAGCGTACT CGTGAGGGTA 5 00 

10 ACGATTTTTA TCACGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAC 550 

TTCACCAAGT CAAAAGTAGC GATGGTTTAT GGTCAGATGA ATGAGCCACC 6 00 

AGGAAACCGT TTGCGTGTTG CATTGACAGG CTTGACGATG GCAGAATATT 650 

TCCGTGACGA AAAAGATGAA GCAACAGGCA AAGGGC GTG A TGTTCTATTG 7 00 

TTCGTTGATA ACATCTATCG TTACACACTG GCTGGTACGG AAGTGTCAGC 7 50 

15 ACTTCTAGGT CGTATGCCAT CAGCAGTAGG TTATCAACCG ACTCTGGCTG 8 00 

AAGAAATGGG TGCGTTGCAA GAGCGTATTA CCTCAACGCA ATCGGGTTCG 8 50 

ATCACTTCGG GG 862 



20 



30 



55 



2 ) INFORMATION FOR SEQ ID NO : 749 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 844 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongata 

(B) STRAIN: ATCC 25295 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO 



40 



45 



50 



ID NO: 749 






TTTTGATGCA 


TTAAAATTAG 


50 


AAC TTTTGGG 


GGATGGTGTA 


100 


GGATTAAAGC 


GTGGTATGGC 


150 


TCCTGTTGGC 


CGTGAAACTT 


200 


CGGTTGATGA 


GGCAGGTCCG 


250 


CAAGAGGCTC 


CTAAGTTTGA 


300 


AACTGGCATT 


AAGGTTATCG 


350 


AAGTAGGTCT 


GTTTGGTGGT 


400 


GAGTTAATTA 


ACAACATTGC 


450 


AGGCGTGGGT 


GAACGTACTC 


500 


AAGATTCCAA 


C GTATTGG AC 


550 


GAACCTCCAG 


GCAACCGTCT 


600 


C GAATAC TTC 


CGTGACGAAA 


650 


TGTTCTTCGT 


GGACAACATT 


700 


TCCGCATTGC 


TGGGTCGTAT 


750 


GGCTGAAGAA 


ATGGGTCGTT 


800 


GCTCTATTAC 


TTCC 


844 



2 ) INFORMATION FOR SEQ ID NO : 750 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 834 bases 

630 



CA 02307010 2000-05-19 



10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 13120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 750 



CGCGACGCTA TTCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGTACTA 100 

15 TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACCGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGCCGTAT 2 00 

TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG -2 50 

ACAAGAGCCG TGCCATTCAC CAAACTGCTC C G AAATTCG A TGAGTTGTCT 3 00 

TCAACTACCG AATTGTTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 3 50 

20 TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 450 

AGCGGCCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG TCAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT 600 

25 TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGATGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 7 00 

CTCTGGC CGG TACCGAAGTA TCCGC AC TGT TGGGTCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAGGAGCG 800 

TATTACTTCT ACGCAAACCG GTTCCATTAC TTCC 834 

30 



2 ) INFORMATION FOR SEQ ID NO: 751 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 
45 (B) STRAIN: ATCC 31426 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751 

CGCGACATGA TTCCGCGCGT TTACGACGCT TTGAAATTAG ACGAAAACGG 50 

50 TCTGACTTTG GAAGTCCAAC AGCTTTTGGG TGATGGCGTA GTCCGTACCA 100 

TCGCTATGGG CAGCTCGGAC GGTTTGAAAC GCGGCATGAC TGTGAGCAAT 150 

ACTGGTTCGC CCATTACTGT GCCGGTAGGT AAAGGTACGT TGGGACGCAT 200 

TGTCGATGTA TTGGGAACTC ' CTGTTGACGA GGCAGGTCCA ATTGATACCG 250 

ACAAGAGTCG TGCCATCCAC CAAGCCGCTC CTAAGTTTGA CGAACTGTCT 3 00 

55 TCCACAACCG AATTGCTCGA AACGGGCATT AAAGTGATTG ACTTGCTGTG 3 50 

TCCGTTTGCC AAAGGCGGTA AAGTAGGTCT GTTCGGCGGT GCCGGTGTGG 400 

GTAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 4 50 

AGCGGCTTGT CCGTGTTCTC AGGCGTGGGT GAGCGTACCC GCGAAGGTAA 5 00 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

60 TGGTGTATGG CCAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT 600 

631 
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TTGACCGGTT TGACTATGGC TGAATACTTC 
CGGTAAAGGT CGTGACGTAT TGTTCTTCGT 
CTCTGGCCGG TACCGAAGTA TCCGCACTGT 
GTGGGTT AC C AACCGACATT GGCTGAAGAA 
5 TATTACCTCT ACCCAAACCG GTTCCATTAC 



CGTGACGAAA AAGACGAAAA 650 

TGACAACATC TACCGTTACA 7 00 

TGGGCCGTAT GCCTTCTGCA 7 50 

ATGGGTC GTT TGCAAGAGCG 800 

TTCC 834 



2 ) INFORMATION FOR SEQ ID NO: 7 52 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 27628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752 

25 ACATGATTCC GCGCGTTTAC GACGCTTTGA AATTAGACGA AAACGGTCTG 50 

AC TTTGG AAG TCCAACAGCT TTTGGGTGAT GGCGTAGTCC GTACCATCGC 100 

TATGGGCAGC TCGGACGGTT TGAAACGCGG CATGACTGTG AGCAATACTG 150 

GTTCGCCCAT TACTGTGCCG GTAGGTAAAG GTACGTTGGG ACGCATTGTC 2 00 

GATGTATTGG GAACGCCTGT TGACGAGGCA GGTCCAATTG ATACCGACAA 250 

3 0 GAGTCGTGCC ATCCACCAAG CCGCTCCTAA GTTTGACGAA CTGTCTTCCA 3 00 

CAACCGAATT GCTCGAAACG GGCATTAAAG TGATTGACTT GCTGTGTCCG 350 

TTTGCCAAAG GCGGTAAAGT AGGTCTGTTC GGCGGTGCCG GTGTGGGTAA 400 

AACCGTGAAC ATGATGGAAT TGATCAACAA CATC GCC AAA GCGCACAGCG 450 

GCTTGTCCGT GTTCTCAGGC GTAGGTGAGC GTACCCGCGA AGGTAACGAC 500 

3 5 TTCTAC C AC G AGATGAAAGA TTCCAACGTA TTGGATAAAG TAGCCATGGT 550 

GTATGGCCAA ATGAACGAAC CTCCAGGCAA CCGTCTGCGC GTTGCTTTGA 600 

CCGGTTTGAC TATGGCTGAA TACTTCCGTG ACGAAAAAGA CGAAAACGGT 650 

AAAGGTCGTG ACGTATTGTT CTTCGTTGAC AACATCTACC GTTACACTCT 7 00 

GGCCGGTACC GAAGTATCCG CACTGTTGGG CCGTATGCCT TCTGCAGTGG 7 50 

40 GTTACCAACC GACATTGGCT GAAGAAATGG GTCGTTTGCA AGAGCGTATT 800 

ACCTCTACCC AAACCGGTTC CATTA 825 



45 2 ) INFORMATION FOR SEQ ID NO: 753 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23970 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 53 

632 
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GATGCGATTC CGCATGTTTA CGATGCCCTG AAATTGGACG AG AACGGTC T 5 0 

GACCCTGGAA GTCCAACAGC TTTTGGGTGA CGGCGTTGTC CGTACTATTG 100 

CAATGGGTAG TTCAGACGGC CTGAAACGCG GCATGTCTGT CAGCAATACC 15 0 

GGTGCGCCAA TCACTGTGCC GGTAGGTAAA GGTACGTTGG GCCGTATTGT 2 00 

5 CGACGTATTG GGTACACCTG TTGACGAAGC AGGTCCGATC GATACCGACA 2 50 

AGAGCCGCGC CATCCACCAA ACCGCCCCGA AATTCGACGA GTTGTCTTCA 3 00 

ACTACCGAAT TGTTGGAAAC CGGCATTAAA GTGATCGATT TGCTGTGTCC 3 50 

GTTTGCTAAG GGCGGTAAAG TAGGTCTGTT CGGTGGTGCC GGTGTGGGCA 400 

AAACCGTGAA CATGATGGAA TTGATCAACA ACATCGCCAA AGCGCACAGC 450 

10 GGTCTGTCCG TGTTTGCAGG CGTGGGCGAG CGTACCCGCG AAGGTAACGA 5 00 

CTTCTACCAC GAGATGAAAG ATTCCAACGT ATTGGATAAA GTAGCCATGG 550 

TGTATGGTCA GATGAACGAA CCTCCGGGCA ACCGTCTGCG CGTTGCTTTG 600 

ACCGGTTTGA CGATGGCCGA ATACTTCCGC GACGAAAAAG ACGAAAACGG 650 

CAAAGGCCGC GACGTATTGT TCTTCGTGGA CAACATCTAC CGTTACACCC 7 00 

15 TGGCCGGTAC CGAAGTATCC GCACTGTTGG GCCGTATGCC TTCCGCAGTG 750 

GGTTACCAAC CGACATTGGC TGAAGAAATG GGTCGTTTGC AAGAGCGTAT 800 

TACCTCTACC CAAACCGGTT CCATTACTTC C 831 



20 



30 



55 



2 ) INFORMATION FOR SEQ ID NO: 754 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 836 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: 2241C 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754 

CCACGCGACA TGATTCCGCG CGTTTACGAC GCTTTGAAAT TAGACGAAAA 50 

CGGTCTGACT TTGGAAGTCC AACAGCTTTT GGGCGACGGC GTAGTCCGTA 100 

CCATTGCGAT GGGCAGCTCG GACGGTTTGA AACGCGGCAT GACTGTGAGC 150 

40 AATACCGGTG CGCCCATTAC TGTGCCGGTA GGTAAAGGTA CGTTGGGACG 2 00 

CATTGTCGAT GTATTGGGAA CGCCTGTTGA CGAGGCAGGT CCAATCGATA 2 50 

CCGACAAGAG CCGTGCCATC CACCAAGCCG CTCCTAAGTT TGACGAACTG 3 00 

TCTTCCACAA CCGAATTGCT CGAAACGGGC ATTAAAGTGA TTGACTTGCT 3 50 

GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG TCTGTTCGGC GGTGCCGGTG 400 

45 TGGGTAAAAC CGTGAACATG ATGGAATTGA TCAACAACAT CGCCAAAGCG 450 

CACAGCGGCT TGTCCGTGTT CGCAGGCGTG GGTGAGCGTA CCCGCGAAGG 5 00 

TAACGACTTC TACCACGAGA TGAAAGATTC CAACGTATTG GATAAAGTGG 550 

CAATGGTTTA CGGTCAGATG AACGAACCTC CGGGCAACCG TTTGCGCGTC 600 

GCATTGACCG GTTTGACCAT GGCGGAATAC TTCCGTGACG AAAAAGACGA 650 

50 AAACGGCAAA GGCCGCGACG TATTGTTCTT CGTGGACAAC ATCTACCGTT 7 00 

ACACTCTAGC TGGTACCGAA GTATCCGCAT TGTTGGGCCG TATGCCGTCT 7 50 

GCAGTGGGCT ACCAACCGAC ATTGGCAGAA GAAATGGGTC GTTTGCAGGA 800 

GCGTATTACC TCTACCCAAA CCGGTTCTAT TACTTC 83 6 



2 ) INFORMATION FOR SEQ ID NO : 755 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 837 bases 

633 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755 



CCGCGTGATG CCATTCCGCG TGTTTACGAC GCCCTGAAAT TGGATGCAAA 50 

CGGCCTGACT TTGGAAGTGC AACAGCTTCT GGGTGACGGC GTAGTTCGTA 100 

15 CTATTGCAAT GGGTAGTTCG GATGGTTTGA AACGCGGCAT GACTGTAAGC 150 

AATACAGGTG CGCCGATTAC AGTACCGGTA GGTAAAGGTA CTTTGGGACG 2 00 

TATTGTCGAT GTATTGGGTA CGCCTGTTGA CGAAGCAGGT CCGATTGATA 250 

CCGACAAACA CCGTGCCATC CATCAGACAG C TCC G AAATT TGATGAACTG 3 00 

TCTGCGACTA CTGAGCTGCT GGAAACCGGC ATTAAAGTGA TTGACTTGCT 3 50 

2 0 GTGTCCGTTT GCCAAAGGCG GTAAAGTAGG TCTGTTCGGT GGTGCCGGTG 400 

TAGGCAAAAC CGTCAACATG ATGGAATTAA TTAACAACAT CGCCAAAGCA 450 

CATAGCGGTT TGTCCGTGTT TGCAGGTGTG GGTGAGCGTA CCCGTGAAGG 500 

TAATGACTTC TACCACGAGA TGAAAGATTC CAACGTATTG GACAAAGTGG 550 

C G ATGGTTT A CGGTCAGATG AACGAACCTC CGGGTAACCG TCTGCGTGTA 600 

25 GCCTTGACCG GTTTGACGAT GGCCGAATAC TTCCGTGATG AAAAAG AC G A 650 

AAGCGGCAAA GGCCGTGACG TATTGTTCTT CGTGGACAAT ATCTACCGTT 7 00 

ACACCCTGGC CGGTACCGAA GT ATCC GC AT TGTTGGGTCG TATGCCTTCA 7 50 

GCAGTAGGTT ACCAACCGAC ATTGGCTGAA GAAATGGGTC GTTTGCAAGA 800 

GCGTATTACC TCTACCCAAA CAGGCTCCAT TACCTCC 837 

30 



2 ) INFORMATION FOR SEQ ID NO : 756 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sub f lava 
45 (B) STRAIN: ATCC 14221 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 56 

CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

5 0 TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGTACTA 100 

• TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 
ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 200 

• TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 250 
ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA CGAGTTGTCT 3 00 

55 TCAACTACCG AATTGCTGGA AACCGGTATT AAAGTGATCG AC TTGCTGTG 3 50 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 400 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 45 0 

AGCGGTC TGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

CGACTTCTAC C AC G AG ATG A AAGATTCCAA CGTATTGGAT AAAGTAGC C A 550 

60 TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

634 
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TTGACCGGTT TGACTATGGC 
CGGTAAAGGT CGCGACGTAT 
CTCTGGCCGG TACCGAAGTA 
GTGGGTTACC AACCGACATT 
5 TATTACCTCT AC C C AAACTG 



CGAATACTTC CGTGACGAAA 
TGTTCTTCGT TGACAACATC 
TClfeCACTGT TGGGCCGTAT 
GGCTGAAGAA ATGGGTCGTT 
GTTCCATTAC TTCC 



AAGACGAAAA 65 0 

TACCGTTACA 700 
GCCTTCTGCA 750 
TGCAAGAGCG 800 
834 



10 



2 ) INFORMATION FOR SEQ ID NO: 757 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Neisseria weaver i 

(B) STRAIN: ATCC 51223 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757 

2 5 CCCGTGATGC TATTCCCATG TATACGATGC CCTGAAATTG GTAGATAACG 50 

ATCTGACCCT GGAAGTGCAA CAACTTTTAG GTGATGGTGT GGTTCGT AC C 100 

ATTGCAATGG GTAGTTCAGA CGGCCTAAAA CGTGGTATGG CTGTTAACAA 15 0 

TACCGGCGCT CCGATTACTG TTCCGGTGGG GAAAGCCACC TTGGGACGTA 200 

TTATGGATGT GTTGGGTAAT CCGGTTGATG AAGCAGGTCC TGTTGTATCA 250 

30 GAAGAAACTC GCGCTATTCA TCAAGCTGCC CCTAAATTTG ACGAAC TGTC 300 

TTCAGCAACT GAGTTGTTGG AAACAGGCAT TAAAGTAATT GACTTGCTGT 350 

GCCCGTTTGC CAAAGGTGGT AAAGTAGGTT TGTTTGGTGG TGCCGGCGTG 400 

GGTAAAACCG TAAATATGAT GGAGTTGATC AACAACATCG CGAAGGCACA 450 

TAGTGGTTTG TCTGTATTCG CCGGTGTAGG TGAGCGTACC CGTGAAGGTA 500 

3 5 ACGACTTCTA CCATGAAATG AAAGACTCTA ACGTATTGGA TAAAGTAGCC 550 

ATGGTTTATG GCCAGATGAA TGAACCTCCG GGTAACCGTT TGCGCGTTGC 600 

TTTGACTGGT TTGACTATGG CCGAATATTT CCGTGACGAG AAAGATGAAA 650 

ACGGCAAAGG TCGTGACGTC TTGTTCTTTG TGGATAATAT C TATCGC TAT 700 

ACTCTGGCCG GTAC TGAAGT GTCTGCACTG TTAGGTCGTA TGCCGTCTGC 750 

40 AGTAGGTTAT CAGCCTACAT TGGCAGAAGA AATGGGTCGC TTGCAGGAGC 800 

GTATTAC TTC TACTCAAACA GGTTCGATTA CTT 83 3 



45 2 ) INFORMATION FOR SEQ ID NO: 758 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
50 <C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria animalis 

(B) STRAIN: ATCC 19573 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758 

635 
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CGCGATGCCA TTCCTCACGT TTACGATGCC CTGAAATTGG ACGACACCGG 50 

TCTGACTTTG GAAGTACAAC AACTTCTGGG C G AC GGTGTG GTACGTACCA 100 

TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GGGGTTTGTC TGTGAGCAAT 150 

ACCGGTTCTC CGATTGCCGT TCCTGTCGGT AAAGCGACTT TGGGTCGTAT 200 

5 TATGG AC GTA TTGGGCAATC CGGTTGATGA AGCCGGTCCG GTTGCTACCG 250 

AAGAGAAACG TGCTATTCAC CAAGCCGCAC CGAAGTTTGA TGAATTGTCA 300 

TCAGCTACCG AGTTGTTGGA AACCGGTATT AAAGTAATCG ACTTGCTGTG 350 

TCCGTTTGCA AAAGGCGGTA AAGTAGGTCT GTTCGGCGGT GCCGGTGTGG 400 

GCAAAACCGT AAACATGATG GAATTGATCA ACAACATCGC CAAAGCACAC 450 

10 AGCGGTCTGT CTGTGTTTGC CGGTGTAGGT GAACGTACCC GCGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTC C AA CGTGTTGGAT AAAGTAGCCA 550 

TGGTGTACGG TCAGATGAAT GAGCCGCCGG GCAACCGCTT GCGCGTGGCT 600 

TTGACCGGCC TGACTATGGC CGAATACTTC GTGACGAAAA AGACGAAAAC 650 

GGCAAAGGTC GTGACGTATT GTTCTTCGTG GACAACATTT ACCGCTACAC 700 

15 ACTGGCCGGT ACCGAAGTAT CAGCATTGTT GGGCCGTATG CCGTCTGCAG 750 

TAGGTTATCA GCCGACATTG GCAGAGGAAA TGGGTCGCTT GCAAGAGCGT 800 

ATTACCTCTA CCCAAACCGG TTCGATTACC TCT » 33 



20 

2 ) INFORMATION FOR SEQ ID NO : 759 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 819 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



55 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus penneri 

(B) STRAIN: ATCC 33519 



35 (xi) SEQUENCE DESCRIPTION: 



40 



45 



50 



AGGTTCAATC ACCTCTGTA 



ID NO: 759 






GTTATGAATG 


GTAAAGAAAA 


50 


CGGTGGTATC 


GTTCGTTGTA 


100 


GTGGCTTAAA 


AGTTGAAGAT 


150 


AAAGCAACAC 


TGGGACGTAT 


200 


GAAAGGTGAT 


ATTGCAACTG 


250 


CAACCTACGA 


AGAGTTATCA 


300 


AAAGTAATGG 


AC TT AATCTG 


350 


GTTCGGTGGT 


GCGGGTGTGG 


400 


GTAATATCGC 


GATCGAGCAC 


450 


GAGCGTACTC 


GTGAGGGTAA 


500 


CGTTCTTGAC 


AAAGTATCGT 


550 


GAAACCGTCT 


GCGTGTAGCA 


600 


CGTGATGAAG 


GCCGTGACGT 


650 


CACCTTAGCC 


GGTACAGAAG 


700 


CGGTAGGTTA 


CCAGCCAACA 


750 


CGTATCACTT 


CAACCAAAAC 


800 






819 



2 ) INFORMATION FOR SEQ ID NO: 7 60 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 819 bases 

636 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuls subsp. choleraesuis 
serotype Enteritidis 
10 (B) STRAIN: ATCC 13076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 60 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

15 GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 200 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 2 50 

AAGAGC GTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 3 00 

2 0 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GC GGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CAGTGTTTGC GGGCGTAGGG GAACGTAC TC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 5 50 

25 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 6 00 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 7 00 

TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 7 50 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 8 00 

30 CGGTTCTATC ACCTCCGTA 819 



2 ) INFORMATION FOR SEQ ID NO : 761 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 761 

50 AC C AAAAGTG TACAACGCCC TTGAGGTAGA AGGTACAACT GAAAAGTTAG 50 

TGCTGGAAGT TCAGCAACAG TTGGGCGGTG GTGTTGTTCG TTGTATCGCA 100 

ATGGGCTCTT C CG ATGGTCT GAGCCGTGGG TTGAAAGTAA CCAACCTAGA 150 

ACACCCGATC GAAGTACCGG TTGGTAAAGC GACCCTTGGC CGTATCATGA 2 00 

ACGTATTGGG TGAACCAATC GACATGAAAG GTCCTATCGG TGAAGAAGAG 2 50 

55 CGTTGGGCAA TCCATCGCGA AGCGCCTTCT TATGAAGAGC TTGCTAGCTC 300 

ACAAGATCTG TTAGAAACCG GTATCAAGGT TATGGACCTG ATTTGTCCGT 35 0 

TTGC TAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTAGGTAAA 400 

ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCTGG 450 

GTATTCTGTA TTTGCCGGTG TAGGTGAGCG TACCCGTGAG GGTAATGACT 500 

60 TCTACCATGA AATGACTGAC TCCAACGTTT TGGACAAAGT ATCCTTGGTT 55 0 

637 
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TACGGCCAGA TGAATGAGCC ACCAGGTAAC 
CGGCCTGACC ATGGCGGAGA AATTCCGTGA 
TGTTTATCGA TAATATCTAT CGTTATACCC 
GCATTGCTGG GTCGTATGCC ATCAGCGGTA 
5 TGAAGAGATG GGTGTGTTGC AGGAACGTAT 
CAATCACCTC TG 



CGTCTACGCG TTGCACTGAC 600 
TGAAGGTCGT GACGTACTGC 650 
TAGCTGGTAC GGAAGTATCC 7 00 

GGTTATCAGC CAACACTGGC 750 
TACTTCCACT AAGACGGGTT 800 

812 



10 2 ) INFORMATION FOR SEQ ID NO: 762 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Burkholderia mallei 

(B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762 

25 CGACGTGATC GAGCCCTTCT TCGTCGACGT GATGCGCTCT TGCAGCTTGC 50 

?caS?tcttc AGCCAGCGTC GGCTGATAGC CCACTGCCGA CGGCATACGG 100 

CCGAGCAGCG CCGACACTTC GGTACCGGCC AGCGTGAAAC GGTAGATGTT 150 

GTCGACGAAG AACAGCACGT CGAGGCCTTC GTCACGGAAG TGCTCGGCCA 2 00 

30 TCGTGAGGCC CGTCAGCGCC ACGCGCAGAC GGTTGCCCGG CGGCTCGTTC 250 

Itctggccgt ACACCAGCGC GACCTTGTCG AGAACGTTCG AGTCCTTCAT 300 

TTCGTGGTAG AAGTCGTTCC CTTCACGGGT ACGCTCGCCC ACGCCCGCGA 3 50 

ACACGGAGTA ACCGCCGTGC TCCTTCGCGA TGTTGTTGAT GAGCTCCATC 400 
ATGTTGAC 

35 



2 ) INFORMATION FOR SEQ ID NO: 763 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doubl e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium sordelln 
50 (B) STRAIN : ATCC 9714 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763 

GAACTTATAA ACAACATAGC TACTCAACAT GGTGGTATAT CAGTATTCGC 50 

55 AGGTGTTGGA G AG AG AAC AA GAGAAGGTAA CGACCTTTAC GGAGAAATGA 100 

GTGAGTCTGG AGTTATAAAC AAGACAGCTC TAGTATTCGG ACAAATGAAT 150 

GAGCCACCTG GAGCAAGAAT GAGAGTTGCT TTAACTGGTC TTACAATGGC 200 

TGAATATTTC AGAGATCAAG AAGGACAAGA CGTTTTATTA TTCGTAGATA 250 

ATATATTCCG TTTCACTCAA GCAGGATCTG AGGTTTCTGC ACTTCTTGGA 3 00 

60 C GT ACTC CAT CAGCAGTTGG ATACCAACCA ACATTAGCTA CAGAGATGGG 350 

638 
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TAGATTACAA GAGAGAATAA CATC TAC AAA TAAAGGGTCT ATAACATCAG 400 



5 2 ) INFORMATION FOR SEQ ID NO: 7 64 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 



20 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 64 



TTAATTCAAG AATTAATCAA CAATATAGCG AAGGAACACG GTGGATTATC 50 

TGTATTTACA GGAGTTGGAG AAAGAACAAG AGAAGGTAAT GACCTTTACT 100 

ATGAAATGAA AGAATCTGGT GTTATAAATA AAACAGCACT AGTATTTGGT 150 

CAAATGAATG AGCCACCTGG AGCAAGAATG AGAGTTGCTC TTACAGGACT 200 

25 TACTATGGCA GAATATTTCA GGGATCAAGG ACAAAACGTA CTTTTATTCA 250 

TAGACAACAT ATTTAGATTC ACTCAAGCAG GTTCAGAGGT GTCAGCTTTA 300 

CTTGGAAGAA TACCTAGTGC C GTTGG AT AC CAACCAACAC TAGCAACAGA 35 0 

AATGGGTGCG CTTCAAGAAA GAATAACATC TACAAAGCAT GGTTC TATAA 400 

CATC A 405 

30 



2 ) INFORMATION FOR SEQ ID NO: 765 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 
45 (B) STRAIN: 20:3.1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 65 

AATTAATAAA TAACATAGCT AAAGAACATG GTGGATTATC AGTGTTTACT 50 

50 GGAGTTGGAG AAAG ATC TAG AGAAGGTAAT GATCTTTATC ATGAAATGAG . 100 

AGAATCAGGC GTTATAGATA AAACCGCATT AGTATTTGGT CAAATGAATG 150 

AGCCACCTGG TGCTAGAATG AGAGTTGCAT TAACAGGGTT AACTATGGCT 2 00 

GAATATTTTA GAGATAAAGG TCAAGATGTA TTACTATTTA TAGATAATAT 2 50 

ATTTAGATTT ACTCAAGCTG GATCAGAAGT TTCAGCATTA CTTGGAAGAA 300 

55 TACCATCAGC AGTTGGTTAC CAACCAACTT TAGCAACTGA AATGGGTGCA 3 50 

CTTCAAGAGA GAATTACATC AACTAAGAAT GGTTCTATAA CAT 393 



60 2 ) INFORMATION FOR SEQ ID NO : 7 66 

639 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: , 

( A ) ORGANISM: Clostridium histolyticum 

(B) STRAIN: ATCC 19401 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766 

15 AGGAACTTAT AAATAATATT GCAAAACAAT ATGGAGGTCT ATCTGTATTT 50 

ACAGGTGTTG GTGAAAGAAC AAGAGAAGGT AATG AC CT AT ATAATGAAAT 100 

SSmmtct GGGGTTATAG AGAAAACTGC ACTAGTATTT GGACAGATGA 150 

ATGAACC AC C AGGAGCGAGA ATGAGAGTAG CATTGACAGG ACTTACTATG 200 

20 SagJatatt t?agagatca AGGGCAAGAT GTACTTTTAT TTATAGATAA 250 

20 TATATTTAGA TTTACGCAGG CTGGTTCTGA AGTTTCTGCA TTGTTAGGAA 3 00 

GAATTCCAAG TGCAGTTGGA TATCAACCAA CCCTTGCAAC TGAAATGGGT 3 50 

gS?Scaag AAAGAATAAC ATCCACAAAA AATGGATCAA TTACTTCAG 399 



25 



40 



2 ) INFORMATION FOR SEQ ID NO: 7 67 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 390 bases 

— (B) — TYPE - : Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus prevotn 

(B) STRAIN: ATCC 9321 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 67 

CATGATGGAA CTGATCCGTA AC ATCGC CAT CGAGCACAGC GGTTATTCCG 50 

TGTTCGCCGG TGTGGGTGAG CGTACTCGTG AGGGTAACGA CTTCTACCAC 100 

45 GAGATGAAGG ACTCCAACGT TCTGGACAAA GTGGCACTGG TCTACGGTCA 150 

GATGAACGAG CCGCCGGGTA ACCGTCTGCG CGTAGCACTG ACTGGCCTGA 200 

CCATGGCCGA G AAGTTC CGT GACGAAGGTA ACGACGTTCT GTTGTTCGTC 2 50 

GACAACATCT ACCGTTACAC CCTGGCCGGT ACTGAAGTAT CCGCACTGCT 3 00 

GGGCCGTATG CCTTCGGCAG T AGGTTAC C A GCCGACCCTG GCTGAAGAGA 3 50 

50 TGGGCGTTCT GCAAGAACGT ATCACTTCGA CCAAGGAAGG 3 90 



2 ) INFORMATION FOR SEQ ID NO: 7 68 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

640 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 68 

10 AGGTC TTGTT CGTGGCAAAA GGTCATTGAC ACTGGTGCTC CTATCACCAT 50 

TCCTGTTGGT AACGAAGTCC TTGGTCGTAT CATTAACGTC ATTGGTGAGC 100 

CCATTGATGA GCGTGGTCCY ATCAAGTCCA AGGCCACTCG TGCTATCCAC 150 

GCTGATGCTC CCGAGTTCGT TGATCAATCC CCCACTCCCG AGATTCTCGA 200 

GACTGGTATC AAGGTTGTCG ATTTGCTTGC TCCCTATGCT CGTGGTGGTA 250 

15 AGATTGGTCT TTTCGGTGGT GCCGGTGTCG GCAAGACTGT CTTGATTCAA 300 

GAATTGATCA ACAACATTGC CAAGGCTCAC GGTGGTT AC T CTATCTTCTG 350 

TGGTGTC GGT GAACGTACTC GTGAAGGCAA CGATTTGTAC CACGAAATGA 400 

TTTCCACTGG TGTCATCAAG CTTGAAGGTG AATCCAAGTG TGCTCTTGTC 450 

TTTGGTCAAA TGAAC GAACC CCCCGGAGCT CGTGCCCGTG TTGC CTTG AC 500 

2 0 TGGTTTGACC ATTGCCGAAT ACTTCCGTGA TGAGGAAGGT CAAGATGTGT 550 

TGCTCTTCAT TGACAACATT TTCCGTTTCA CTCAAGCCGG TTCTGAAGTG 600 

TCCGCTTTGC TTGGTCGTAT TCCCTCTGCT GTCGGTTACC AACCCACTCT 650 

YTCCACTGAT ATGGGTGGTA TGCAAGAGCG TATTACTACC ACCAAGAACG 700 

GTTC C ATT AC CTCCGTGCAA GCTGTCTACG TCCCTGCTGA CGATTTGACT 750 

25 GATCCTGCTC CTGCTACTAC TTTTGCTCAC TTGGACGCCA C C ACTGTGTT 800 

GTCTCGTTCC ATTGCTGAGT TGGGTATCTA CCCTGCTGTC GATCCCCTTG 850 

ACTCCAAGTC CCGTATCTTG GATCCCCGTA TCGTTGGTGA TGAGCACTAC 900 

TCTGTTGCCA CTGGTGTCCA ACAAATCCTT CAAAACTACA AGTCGCTTCA 950 

AGA 953 

30 

2 ) INFORMATION FOR SEQ ID NO: 769 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 
45 (B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769 

CGCGGTTCCA AGGCCACCGA CACCGGTGCC CCCATCAAGA TTCCCGTTGG 50 

50 TCACGGTACC CTTGGTCGTA TCATGAACGT CACTGGTGAC CCCATTGACG 100 

AGCGTGGTCC CATCAAGGCC ACCAAGTACG CTCCCATCCA CGCCGACCCC 150 

CCGGAGTTCA CCGAGCAATC CACCTCCGCT GAGGTCCTCG TTACCGGTAT 200 

CAAGGTTGTC GACCTGTTGG CTCCTTACGC TCGTGGTGGA AAGATTGGTC 250 

TCTTCGGAGG TGCTGGTGTC GGAAAGACTG TCTTCATTCA GGAGCTGATT 300 

55 GTAAGGAGAC ACACTGTCTA CTGGCTGAGC ATTAGCTAAC GGCAGGCAGA 350 

ACAACATCGC CAAGGCCCAC GGTGGTTTCT CTGTCTTCAC TGGTGTCGGT 400 

GAGCGTACCC GTGAGGGTAA CGATCTGTAC C AC GAGATGC AGGAGACTTC 450 

CGTCATTCAG CTTGACGGTG ACTCCAAGGT CGCCCTCGTC TTCGGTCAGA 500 

TGAACGAGCC CCCGGGTGCC CGTGCCCGTG TCGCTCTTAC TGGTCTTACT 550 

60 GTTGCTGAGT AAGTCTTGAA TTCACTGTGT TGACAACGTC GTGGCTAATG 600 

641 
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fPAAAAAAGA TACTTCCGTG ACGAAGAGGG ACAGGATGTG CTTCTCTTCA 65 0 

GTGCTTGTAC GAACTGCCTG TGAGACATAC ACTGACTTCG 700 

GCAATAGACA TTTTCCGTTT CACCCAGGCC GGTTCCGAGG TGTCCGCTCT 750 

?S?GOTCGT ATCCCCTCTG CCGTCGGTTA CCAGCCCACC CTCGCCATTG 800 

* ATATGGGTGT CATGCAGGAG CGTATTACCA CCACCACCAA GGGTTCCATC 850 

AOTC?GTC? AGGCCGTCTA CGTGCCCGCT GACGATTTGA CTGACCCTGC 900 

SScCGCCACC ACCTTCGCCC ATTTGGACGC CACCACTGTC TTGTCCCGTG 950 

gS5ctccga gttgggtatc TACCCCGCCG tcgaccctct TGACTCCAAG 1000 

TC?CG?ATGT TGGACCCCCG TGTCATTGGT CAGGACCACT ACGACACCGC 1050 

in S?C?GCG?? CAGCAGATTC TCCAGGAGTA CAAGTCGCTC CAGGATATCA 1100 

TtSSttS CGGTATGGAC GAGTTGTCGG AAGCTGACAA GCTTACCGTC 1150 

GAGCGTGCCC GTAAGATCCA GCGTTTCTTG AGCCAGCCTT TCGCTGTCGC 1200 

rr AGGTCTTC ACTGGTATTG AGGGCAAGCT TGTCGACCTC AAGGACACCA 1250 

TCCGATCaS CAAG^C?ATC TTGACTGGTG AGGGTGACGA CCTTCCCGAG 1300 

15 GGTGAGTCTC GACTATCTCC GCATTCATAG CGTATAACTG ACA 1343 



2 ) INFORMATION FOR SEQ ID NO: 770 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Aspergillus flavois 

(B) STRAIN: ATCC 26947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 770 

-i c TTCAGGAGTT GATTGTATGT TCACCTGCAA CATAAGACTT CCCATTCTCC 50 

35 a?TCT??TCT AACTCTTCAC AGAACAACAT TGCCAAGGCT CACGGTGGTT 100 

ACTCTGTCTT CACTGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGATCTG 150 

TACCACGAAA TGCAGGAGAC TGGTGTCATT CAGCTCGAGG GTGAATCTAA 200 

StcGCCOT OTCTTCGGTC AGATGAACGA GCCCCCAGGT GCCCGTGCCC 250 

40 GTGTCGCCCT TACCGGTCTG ACCATCGCCG AGTACTTCCG TGACGAGGAA 300 

GGTCAGGATG TGCTGCTCTT CATTGACAAC ATTTTCCGTT TCACCCAGGC 3 50 

CGGTTCTGAG GTGTCTGCCC TTCTTGGTCG TATCCCCTCC GCTGTCGGTT 400 

ACCAGCCCAC TCTGGCCGTC GACATGGGTG GTATGCAGGA GCGTATTACC 450 

ACCACCACCA AGGGTTCCAT TACCTCCGTC 480 

45 



2 ) INFORMATION FOR SEQ ID NO: 771 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 bases 

(B) TYPE: Nucleic acid 

( c ) STRAND EDNES S : Doub 1 e 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mucor circinelloides 
SO (B) STRAIN: ATCC 38592 

642 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 771 



CTCGAACAAG 
5 YCACTCTGGT 
ACACTGTCCG 
TCCCANNGGA 
GATGATTGAC 
CACTGGTGCT 

10 TCATCAACGT 
AAGACTCACC 
CCCCACTCCC 
CTC CTTATGC 
GGTAAG AC TG 

15 CGGTGGTTAC 
ACGATTTGTA 
GAGTCCAAGT 
TCGTGCCCGT 
ATGATGAGGG 

2 0 ACTCAAGCTG 
TGTCGGTTAC 
GTATTACTAC 
GTCCCTGCTG 
CTTGGATGCC 

2 5 AYCCCGCTGT 
ATTGTCGGTG 
C C AAAAC T AC 



AY AACTTGC C 
GGACGTCTCG 
TACTATTGCT 
TACAGTCARA 
CAATATGTTA 
CCCATTACCA 
TATTGGTGAA 
GTC CTATTC A 
GAAATCCTCG 
TCGTGGTGGT 
TCTTGATTCA 
TCTATCTTCT 
CCATGAAATG 
GTGCTCTCGT 
GTCGCTTTGA 
TCAAGATGTC 
GTTCTGAGGT 
CAACCCACTC 
CACCAAGAAC 
ATGATTTGAC 
ACCACTGTCT 
CGATCCTCTT 
ATGAGC AC T A 
AAGTCTCTCC 



YGCCATTTTG 
TYCTCGAAGT 
ATGGATGGTA 
CAGMAATGTC 
GGTCTTGTCC 
TCCCCGTTGG 
CCCATTGATG 
CGCTGAAGCT 
AGACTGGTAT 
AAGATTGGTC 
AGAATTGATT 
GTGGTGTCGG 
ATTGAAACCG 
CTTTGGTCAA 
CTGGTTTGAC 
TTGCTTTTCA 
ATCTGCCCTT 
TTTCCACYGA 
GGTTCCATTA 
CGATCCTGCT 
TGTCTCGTTC 
GATTCCAAGT 
CAAGGTTGCC 
AAGA 



AACGCCCTTG 
CTCTCAACAT 
CTGAAGGTAA 
TAGTGGTTAT 
GGTGGTCAAA 
TAAGGAAGTC 
AACGTGGTCC 
CCCGAATTCG 
CAAGGTYGTC 
TCTTCGGTGG 
AACAACATYG 
TGAACGTACT 
GTGTCATTCA 
ATGAACGAAC 
TATTGCTGAA 
TTGATAACAT 
TTGGGTCGTA 
TATGGGTGGT 
CCTCCGTCCA 
CCTGCCACCA 
CATCGCTGAA 
CTCGTATCCT 
ACTGAAGTTC 



AAGTC AAGG A 5 0 

TTGGGTGAGA 100 

GTTATGTYCA 150 

AGCAGYAGCA 200 

AGGTTGTTGA 250 

CTTGGTCGTA 300 

CATTGACGCC 350 

TTGATCAATC 400 

GATTTGTTGG 45 0 

TGCTGGTGTC 500 

CCAAGGCTCA 55 0 

CGTGAGGGTA 600 

ATTGGAAGGC 65 0 

CCCCAGGTGC 7 00 

TACTTCCGTG 750 

TTTCAGATTC 800 

TTCCTTCCGC 850 

ATGCAAGAGC 900 

AGCTGTCTAC 950 

CTTTTGCTCA 1000 

TTGGGTATCT 1050 

CGATCCCCGT 1100 

AACAAATCCT 1150 
1174 



30 



40 



2 ) INFORMATION FOR SEQ ID NO : 772 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 467 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

(B) STRAIN: ATCC 24292 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 772 

AGGAGCTTAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

ACTGGTGTCG GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT 100 

GCAAGAGACT TCCGTCATTC AGCTTGACSG CGAGTCCAAG GTCGCTCTCG 150 

5 0 TGTTCGGTCA GATGAACGAG CCCCCGGGTG CCCGTGCCCG TGTTGCCCTG 200 

ACTGGTCTTA CCATCGCTGA GTACTTCCGT GATGCCGAGG GTCAGGATGG 250 

T AAGTTC TAT AACTCTTGTC GCAAAGGTTT CATTCTGGTC GCT AACTTGC 3 00 

TCAGTGCTCC TGTTCATCGA CAACATTTTC CGTTTCACCC AGGC TGGTAT 3 50 

GGAGGTGTCT GCCCTCCTCG GTCGTATTCC TTCTGCCGTC GGTTACCAAC 400 

55 CTACTCTCGC CGTCGACATG GGTGGTATGC AAGAGCGTAT TACCACTACC 450 

AAGAAGGGAT CCATTAC 467 



60 2 ) INFORMATION FOR SEQ ID NO: 773 

643 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 8 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

in (vi) ORIGINAL SOURCE: . 

( A ) ORGANISM: Pseudalleschena boydxi 

(B) STRAIN: ATCC 44331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 773 

15 TTPCCAAGGC CCACGGTGGT TACTCTGTCT TCACTGGTGT CGGTGAGCGT 50 

acc^Stcagg GTAACGATCT gtaccacgaa ATGCAGGAAA CCTCCGTCAT 100 

TCAGCTTGAT GGCGAGTCCA AGGTCGCGCT TGTCTTCGGT CAGATGAACG 150 

AGCCCCCTGG TGCCCGTGCT CGTGTCGCTC TTACTGGTCT TACCGTTGCC 200 

20 gIStacttcc GTGATGAGGA GGGTCAGGAT GGTAAGTTAT ATCGTTTTTA 250 

TTATCTTCTT TGCCACCACC CCTCTACGAA TCCATGCCTC CGTTGGTGAA 300 

GGCATCGTTT GTAGGGCGGG TCGGAGTTTG CGGCAATTTC TGCCGTC GGC 350 

OTG^GCCGC GGATGCCCGA TGTTTGACGC GTATCGATGC TAACAACAAT 400 

ga2aa?agtg CTTCTCTTCA TTGACAACAT TTTCCGATTC ACCCAGGCCG 450 

25 GTTCCGAGGT SrCTGCCCTT CTCGGTCGTA TTCCCTCTGC CGTCGGTTAC 500 

CAGCCCACTC TTGCCGTAGA CATGGGTGCC ATGCAGGAAC GTATTACCAC 550 



CACCAAGAAG GGTTCGATTA CCTCCGTC 



578 



30 

2 ) INFORMATION FOR SEQ ID NO: 774 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1123 bases 
35 (b) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhizopus oryzae 

(B) STRAIN: ATCC 56015 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774 

AACTTACCYG CTATCTTGAA CGCTCTCGAA GTCCAAGATC ACTCTGGTGG 50 

ACGTCTTGTC CTTGAAGTTC GCTCAACACT TGGGTGAAAA TACTGTCCGT 100 

ACTATTGCTA TGGATGGTAC TGAAGGTAAG CTATACTATA ACCGTKTTAT 150 

50 CCGAGTATGA TATTAACTTG AAAAAAGGTC TCGTYCGTGG TCAAAAGGTT 200 

ATTGACACTG GTGCTCCCAT TACCATTCCT GYTGGTAAGG AAGTTCTCGG 250 

TCGTATCATT AACGTCATTG GTGAACCCAT CGATGAACGT GGTCCTATCA 300 

ACGCCAAGAG CCAACGTCCC ATTCACGCCG AAGCTCCCGA ATTCGTTGAC 350 

CAATCTCCTA CTCCCGAAAT TCTTGAAACT GGTATCAAGG TTGTCGACTT 400 

55 GTTGGCTCCT TATGCTCGTG GTGGTAAGAT TGGTCTTTTC GGTGGTGCTG 450 

GTGTCGGTAA GACTGTGTTG ATTCAAGAAT TGATTAACAA CATCGCCAAG 500 

GCTCACGGTG GTTACTCTAT TTTCTGTGGT GTCGGTGAAC GTACTCGTGA 550 

AGGTAACGAT CTTTACCACG AAATGATTGA AACTGGTGTC ATCAAGCTCG 600 

ATGGTGACTC CAAGTGTGCT C TTGTCTTTG GTCAAATGAA CGAACCCCCA 650 

60 GGAGCTCGTG CC CGTGTTGC CTTGACTGGT TTGAC C ATTG CTGAATACTT 700 

644 
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CCGTGATGAT GAAGGTCAAG ATGTGTTGCT 
GTTTCACCCA AGCTGGTTCW GAAGTATCTG 
TCCGCTGTCG GTTACCAACC CACTCTTTCT 
AG AAC GT ATT ACAACCACCA AGAACGGTTC 
5 TCTACGTCCC TGCTGATGAT TTGACCGATC 
GCTCACTTGG ATGCCACCAC TGTCTTGTCT 
TATTTACCCT GCCGTCGATC CTCTTGAYTC 
CTCGTATCGT TGGTGACGAA CACTACAAGG 
ATCC TTC AAA AC TAC AAGTC TCT 



TTTCATTGAT AACATTTTCC 7 50 

CCCTTTTGGG TCGTATTCCC 800 

ACTGATATGG GTGGTATGCA 850 

CATTACATCC GTCCAAGCTG 900 

CTGCTCCCGC CACCACTTTT 950 

CGTTCCATTG CCGAATTGGG 1000 

CAAGTCTCGT ATCTTGGATC 1050 

TCGCTACCGA AGTTCAACAA 1100 

1123 



2 ) INFORMATION FOR SEQ ID NO : 775 

15 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



40 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scopulariopsis koningii 
25 (B) STRAIN: ATCC 38745 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 77 5 

ATTCAGGAGC TCATCAACAA CATTGCCAAG GCTCACGGTG GTTACTCTGT 50 

3 0 GTTCACTGGT GTCGGTGAGC GTACCCGTGA GGGTAACGAT CTGTACCACG 100 

AAATGCAGGA GACTTCGGTC ATTCAGCTCG AGGGCGAGTC CAAGGTCGCG 150 

CTTGTGTTCG GTCAGATGAA CGAGCCCCCC GGTGCCCGTG CCCGTGTCGC 2 00 

CCTTACCGGT CTGAC C GTTG CCGAGTACTT CCGTGACGAG GAGGGCCAGG 25 0 

ATGGTGAGTA ACCGAC GAAG TCTGAGATCT TGTCGGGCAT TATTCTAACG 3 00 

35 AC AAC TAGTG CTTCTCTTCA TC G AC AAC AT TTTCCGCTTC ACCCAGGCCG 3 50 

GTTCCGAGGT GTCCGCGCTT CTCGGCCGTA TCCCCTCTGC CGTCGGTTAC 400 

CAGCCCACCC TGGCCGTCGA CATGGGAGGT ATGCAGGAGC GTATTACCAC 450 

GACTCAGAAG GGCTCGATTA CCTCGGT 477 



2 ) INFORMATION FOR SEQ ID NO: 77 6 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 610 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776 



TCCGGAGTTG ATTGTAAGTC ATTTGAAACC CAGCCCCAAG AAACAGAAGC 50 

TAGGCGAAAA TTGGACAATT GAGCAATTTA GCCATTGGAG AAAAGAAATT 100 

60 TCGAGTATTA ATTGTTTTTA TAGAACAACA TTGCCAAGGC TCACGGTGGT 150 

645 
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TACTCTGTCT TCACTGGTGT 
CTACCATGAA ATGCAGGAGA 
AGGTCGCCCT GGTCTTCGGC 
CGTGTTGCTC TTACTGGTTT 
5 AGGTCAAGAC GGTGAGTTTY 
TTTTTTMAAR AAATTC ATGT 
CATCGACAAC ATTTTCCGTT 
TGCTTGGTCG TATCCCATCT 
GACATGGGTG GTATGCAGGA 
10 TACCTCCGTC 



CGGAGAGCGT ACCCGTGAAG 
CCCGTGTCAT TCAGCTTGAT 
CAGATGAACG AGCCCCCAGG 
GACCATTGCT GAGTACTTCC 
TTATGGATAA AARATTTTTT 
TCTAACAAAG TATATCCTAG 
TCACTCAGGC TGGTTCCGAA 
GCCGTCGGTT ACCAACCCAC 
ACGTATTACC ACCACCAAGA 



GAAACGATCT 2 00 

GGCGAGTCCA 250 

TGCCCGTGCC 3 00 

GTGATGAGGA 350 

TTTTTTTTTT 400 

TGCTTCTCTT 450 

GTGTCTGCCC 500 

TCTTGCCGTC 550 

AGGGATCCAT 600 
610 



15 



20 



2 ) INFORMATION FOR SEQ ID NO: 777 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

( c ) STRANDEDNESS : Doubl e 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



593 



(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777 

™ GGAGTTGATT GTAA&TCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 

-g?g^?tc-Sacaattgaa CAATTTAGCC CTTGGAGAAA agaaatttcg 100 

?g?SSJtt atttttatag aacaacattg ccaaggctca cggtggttac 150 

TCTGTCTTCA CTGGTGTCGG AG AGCGTAC C CGTGAAGGAA AC G ATCTC T A 200 

ccISgaaatg caggagaccc GTGTCATTCA GCTTGATGGC GAGTCCAAGG 250 

35 ScGCC^G? CTTCGGCCAG ATGAACGAGC CCCCAGGTGC CCGTGCCCGT 300 

GTTGCTCTTA CTGGTTTGAC CATTGCTGAG TACTTCCGTG ATGAGGAAGG 350 

SJSrATGCT GAGTTTCTTA TGGATGAAAG ATTTTTTTTT TTCAAGAAAT 400 

tc^gSS acSIg?Sta Sctagtgct TCTCTTCATC GACAACATTT 450 

TCCGTTTCAC TCAGGCTGGT TCCGAAGTGT CTGCCCTGCT TGGTCGTATC 500 

40 SaSgCCG TCGGTTACCA ACCCACTCTT GCCGTCGACA TGGGTGGTAT 550 
GCAGGAACGT ATTACCACCA CCAAGAAGGG ATCCATTACC TCC 



45 2 ) INFORMATION FOR SEQ ID NO: 77 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 77 8 

60 

646 
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CCGTGGTCAA GAAGTTATTG ACACTGGTGC CCCAATTACC ATTCCTGTTG 50 

GTCGTGGTAC TC TTGGTAG A ATTATCAACG TCATTGGTGA ACCAATTGAC 100 

GAACGTGGCC CTATCAAGGC TTCTAAGTAT GCTCCTATCC ATACTGAACC 150 

ACCAACCTTT GCTGAACAAT CTACTTCTGC TGAAGTTCTT GAAACCGGTA 200 

5 TCAAGGTTGT CGATCTTCTT GCTCCTTACG CCCGTGGTGG TAAGATTGGT 250 

CTTTTCGGTG GTGCTGGTGT CGGTAAGACT GTCTTCATTC AAGAACTTAT 3 00 

TAACAACATT GCCAAGGCTC ACGGTGGTTT CTCTGTCTTC ACTGGTGTCG 350 

GTGAAAGAAC CCGTGAAGGT AACGATCTTT ACCGTGAAAT GAAGGAAACT 400 

GGTGTCATCA ACCTCGAAGG TGACTCCAAG GTCGCTCTCG TTTTCGGTCA 450 

10 AATGAACGAA CCTCCAGGTG CCCGTGCCCG TGTCGCTTTG ACTGGTCTTA 500 

CCATTGCCGA ATACTTCCGT GATGAAGAAG GACAAGATGT CTTGCTTTTC 550 

GTTGACAACA TTTTCAGATT CACCCAAGCC GGTTCTGAAG TCTCTGCTCT 600 

TTTGGGTCGT ATTCCATCTG CCGTCGGTTA CCAACCTACC CTTGCTACCG 650 

ATATGGGTGC CCTCCAAGAA CGTATTACCA CCACCCAAAA GGGTTCCGTC 7 00 

15 ACATCTGTCC AAGCCGTCTA TGTCCCAGCA GACGATTTGA CTGATCCTGC 750 

CCCAGCCACC ACTTTCGCTC ACTTGGACGC CACCACTGTC TTGTCTCGTT 800 

CCATTTCCGA ATTGGGTATC TACCCAGCTG TCGATCCTCT CGATTCCAAG 850 

TCTCGTCTTT TGGATCCTGA AGTTATTGGA CACGAACACT ACGAAGTTGC 900 

CACTCAAGTT CAACAAACCC TCCAAGCTTA CAAGTCTCTC CAAGATATCA 950 

2 0 TTGCCATTTT GGGTATGGAT GAATTGTCTG AAGCTGATAA GCTTACTGTC 1000 

GAACGTGCCC GTAAGATCCA AAGATTCCTT TCCCAACCAT TCGCTGTTGC 1050 

CGAAGTTTTC ACTGGTATCG AAGGTCGTCT CGTTCCATTG AAGGAAACCG 1100 

TCAGATCTTT CAAGGAAATC CTTGAAGGTA AGTACGATCA C 1141 

25 



2) INFORMATION FOR SEQ ID NO: 779 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1093 bases 

— - (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 22864 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 79 



40 



GCTGAAGGCC GAACGTGAGC GTGGTATCAC CATCGATATC GCGCTCTGGA 5 0 

AGTTCGAGAC TCCCAAGTAC TTCGTCACCG TCATCGATGC CCCTGGTCAT 100 

45 CGTGAC TTC A TCAAGAACAT GATCACTGGT ACCTCCCAGG CTGATTGTGC 150 

TATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCC GGTATCTCCA 200 

AGGATGGCCA GACCCGTGAG CATGCTCTGC TCGCCTACAC CCTGGGCGTG 250 

AAGCAGCTTA TCGTCGCCAT CAACAAGATG GACACCACCA AATGGTCTGA 300 

GGATCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

50 TCGGATACAA CCCCAAGTCC GTTCCATTCG TGCCCATCTC CGGCTTCAAC 400 

GGTGACAACA TGATCGACGT CTCCACCAAT GCCCCCTGGT AC AAGGGC TG 450 

GGAAAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTTGAGG 500 

CTATCGACTC CATCGACCCT CCTGCTCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCACTCC AGGATGTCTA CAAGATTTCT GGTATCGGCA CGGTGCCCGT 600 

5 5 CGGTCGTGTT GAGACTGGTA CCATCAAGGC CGGTATGGTC GTCACCTTTG 65 0 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAA 700 

CAGCTCGCCG AGGGCGTTCC GGGTGACAAC GTCGGCTTCA ACGTCAAGAA 75 0 

CGTCTCCGTG AAGGAGGTTC GTCGTGGAAA CGTTGCTGGT GACTCCAAGA 800 

ACGACCCCCC CAAGGGTGCC GACTCCTTCA ACGCCCAGGT CATCGTCCTC 85 0 

6 0 AACCACCCTG GTCAGGTCGG TGCTGGCTAC GCCCCGGTCT TGGATTGCCA 900 

647 
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CACTGCCCAC ATTGCCTGCA AGTTCTCTGA 
GTCGTACCGG CAAGTCCATG GAAAACAACC 
GATGCTGCCA TCGTGAAGAT GGTTCCCAGC 
TTTCACCGAC TACCCTCCTC TTGGTCGTTT 



GCTCCTCGAG AAGATCGATC 950 

CCAAGTTCAT CAAGTCTGGT 1000 

AAGCCTATGT GCGTTGAGGC 1050 

CGCCGTCCGT GAC 1093 



2) INFORMATION FOR SEQ ID NO : 7 80 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 52 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doub 1 e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunnlnghamella bertholletiae 
20 (B) STRAIN: ATCC 42115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 80 

TACTTGTAAA TGGTCTCAAG ATCGTTACAA CGAAATTGTT AAGGAAGTTT 50 

25 CTTCCTTCAT CAAGAAGATT GGTTACAATC CTAAATCCGT TCCTTTCGTY 100 

CCTATCTCTG GTTGGCACGG TGATAACATG TTGGAAGCTT CTACCAACAT 150 

GC CTTGGT AC AAGGGATGGA CCAAGGAAAC TAAAGCTGGT TCTTCCACTG 200 

GTAAGACTCT CTTAGAAGCC ATTGACAGCA TTGAACCTCC TACCCGTCCT 250 

TCTGACAAGC CTTTACGTCT TCCTTTACAA GATGTTTACA AGATTGGTGG 3 00 

30 TATTGGTACT GTC CCTGTTG GTCGTGTTGA AACTGGTGTC ATCAAGGCTG 350 

otJtggttgt TACYTTCGCT CCCGCTAACG TCACCACTGA agttaagtcc 400 

GTTGAAATGC ATCACGAACA ATTAGAACAA GGTGTTCCTG GTGACAACGT 450 

TGGTTTCAAC GTCAAGAACG TTTCCGTTAA GGATATCCGT CGTGGTAACG 500 

TCTGTTCCGA CTC C AAGAAC GACCCCGCTA AGGAATCTGC TTCCTTCAAC 550 

35 GCTCAAGTTA TCGTCTTGAA CCACCCTGGT CAAATTGGTG CTGGTTATGC 600 

CCCAGTTCTT GACTGTCACA CTGCTCACAT TGCTTGTAAG TTCGC TGAAT 650 

SaTTAGAAAA GATCGATCGT CGTTCCGGTA AGAAACTCGA AGATGCTCCT 700 

AAATTCGTTA AATCTGGTGA CTCTGCTATC GTTAAGATGG TTCCTTCCAA 750 
GC 

40 

2 ) INFORMATION FOR SEQ ID NO: 781 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNE SS : Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 
55 (B) STRAIN: ATCC 26425 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781 

CAAGTGGTCT GAGGACCGTT ACCAGGAAAT CATCAAGGAG ACCTCCAACT 50 
60 TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 100 

648 
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TCCGGTTTCA ACGGAGACAA CATGATTGAG GCCTCCACCA ACTGCCCCTG 150 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 2 00 

TCCTTGAGGC CATCGACGCC ATCGACCCTC CTGTCCGTCC TACCGACAAG 250 

CCCCTCCGCC TTCCCCTCCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 3 00 

5 GGTCCCCGTC GGTCGTGTCG AGACCGGTAT CATCAAGCCC GGTATGGTCG 350 

TCACCTTCGC CCCCGCTGGT GTCACCACCG AAGTCAAGTC CGTCGAGATG 400 

CACCACGAGC AGCTTACTGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 45 0 

CGTCAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 500 

ACTCCAAGAA CGACCCCCCC AAGGGTTGCG AGTCCTTCAA CGCCCAGGTC 550 

10 ATCGTCCTCA ACCACCCTGG TCAGGTCGGT GCCGGTTACG CCCCAGTCCT 600 

TGACTGCCAC ACTGCCCACA TTGCCTGCAA GTTCTCCGAG CTCCTCGAGA 65 0 

AGATC G ACCG CCGTACCGGA AAGTCTGTTG AGAACTCCCC CAAGTTCATC 700 

AAGTCCGGTG ACGCCGCCAT CGTCAAGA 72 8 



15 



30 



2 ) INFORMATION FOR SEQ ID NO: 7 82 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 



-("xi")— SEQUENCE "DESCRIPTION : SEQ ID NO: 782 



TGAAGTCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACTC CCAAGTAAGG CTCAACAGAC ACAACAAGCA AATGCATACT 100 

35 CGCTAACCTA TTCACCCACC ACAGGTACAA CGTCACCGTC ATTGACGCCC 15 0 

CCGGTCACCG TGATTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 200 

GACTGCGCCA TTCTCATCAT TGCCGCCGGT ACTGGTGAGT TCGAGGCCGG 250 

TATCTCCAAG GACGGTCAGA CCCGTGAGCA CGCTCTTCTC GCCTACACCC 3 00 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAG 350 

40 TGGTCTGAGG CCCGTTACCA GGAGATCATC AAGGAGACCT CCGGTTTCAT 400 

CAAGAAGGTC GGCTTCAACC CCAAGCACGT TCCCTTCGTG CCCATCTCCG 450 

GTTTCAACGG TGACAACATG ATCGACGTCT CCACCAACTG CCCCTGGTAC 500 

AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACCGGCA AGACCCTCCT 550 

CGAGGCCATT GACGCCATCG ACCCCCCCAC TCGTCCCACC GACAAGCCCC 600 

45 TCCGTCTTCC CCTYCAGGAT GTCTACAAGA TCGGTGGTAT TGGCACGGTT 650 

CCCGTCGGTC GTGTCGAGAC CGGTACCATC AAGGC CGGCA TGGTCGTCAC 7 00 

CTTCGCCCCC GCTGGTGTCA CCACTGAGGT CAAGTCCGTC GAGATGCACC 750 

ACGAGCAGCT TCCCGAGGGT CTCCCCGGTG ACAACGTCGG CTTCAACGTC 800 

AAGAACGTCT CCGTCAAGGA GATC CGTCGT GGCAACGTCG CCGGTGACTC 850 

50 CAAGAACGAC CCCCCCAAGG GCTGCGACAG CTTCAACGCC CAGGTCATCG 900 

TCCTCAACCA CCCCGGTCAG GTCGGCGCCG GCTACGCGCC CGTCCTCGAC 950 

TGCCACACTG CTCACATTGC TTGCAAGTTC TCTGAGCTCC TCGAGAAGAT 1000 

CGACCGCCGT ACCGGCAAGT CCATTGAGGC CAGCCCCAAG TTCATCAAGT 1050 

CTGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAGCC TATGTGCGTT 1100 

55 GAGGCCTTCA CCGACTACCC CCCTCTTGGA CGTTTCGCCG TCCGT 1145 



60 



2 ) INFORMATION FOR SEQ ID NO: 783 



649 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Microsporum audouiniz 

(B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 83 

15 GCTCAAGGCT GAGCGTGAGC GTGGTATCAC CATTGACATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC ATGGTCACCG TCATCGGTAT GCTTTATCTG 100 

TTTCCCATTT ATAGTTGCGA CAAGTAACTA ATAAAAAGTA GATGCCCCCG 150 

GACACCGTGA CTTCATCAAG AACATGATTA CTGGTACCTC CCAGGCCGAC 2 00 

TGCGCTATTC TCATCATTGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 250 

2 0 CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCT TTCACCCTCG 300 

GTGTCAAGCA GCTCATCGTT GCCATCAACA AGATGGACAC CACCAACTGG 3 50 

TCTGAGTCCC GTTTCGGTGA AATCATCAAG GAAGTCACCA ACTTCATCAA 4 00 

GAAGGTCGGC TACGACCCCA AGGGTGTCCC ATTCGTCCCA ATCTCTGGCT 45 0 

TCAACGGTGA CAACATGATT GAGCCCTCCA CCAACTGCCC ATGGTAC AAG 500 

25 GGATGGAACA AGG AG AC C AA GGCCGGTGGC AAATCCTCTG GTAAGACCCT 550 

CCTTGAGGCC ATCGATGCCA TTGACATGCC CACTCGTCCC ACCGACAAGC 600 

CTCTCCGTCT CCCACTCCAG GATGTC TAC A AGATCTCTGG TATCGGAACA 650 

GTACCAGTCG GTCGTGTTGA G AC TGGTATC ATCAAGCCTG GTATGGTTGT 7 00 

CACTTTCGCC CCCGCCAACG TC AC C AC TG A AGTCAAGTCC GTCGAAATGC 750 

30 ACCACCAGCA GCTCGTTCAG GGTGTTCCCG GTGACAACGT TGGCTTCAAC 800 

GTCAAGAACG TCTCTGTCAA GGAAGTCCGC CGTGGTAACG TTGCCGGTGA 850 

TTCCAAGAAC GACCCACCAT CTGGCTGCGC CTCTTTCAAG GCCCAGGTCA 900 

TCGTCCTCAA CCACCCCGGC CAGATCGGTG CTGGTTACGC CCCAGTCCTC 950 

GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC TTCTTGAGAA 1000 

35 GATTGACCGC CGTACTGGTA AATCCGTCGA AACCAGCCCT AAGTTCGTCA 1050 

AGTC TGGTGA TGCCGCTATT GCCACCATGG TTCCATCCAA GCCCATGTGC 1100 

GTTGAGGCTT TCACTGACTA CCCACCACTT GGTCGTTTCG CCGTCCGTGA 115 0 

C 1151 



40 



55 



2) INFORMATION FOR SEQ ID NO: 784 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 979 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Afucor circinelloides 

(B) STRAIN: ATCC 38592 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 84 



CCAAGTGGTC TCAAGATCGT T AC AAC G AAA TTGTCAAGGA AGTTTCCGGT 50 

TTCATCAAGA AGATCGGTTT CAACCCCAAG TCCGTTCCTT TCGTTCCCAT 100 

60 TTCTGGCTGG CACGGTGATA ACATGTTGGA TGAATCCACC AACATGCCCT 150 

650 
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GGTTCAAGGG ATGGAACAAG GAGACCAAGG CCGGTTCCAA GACTGGTAAG 200 

ACTCTCCTCG AAGCCATCGA TGCCATTGAG CCCCCTGTCC GTCCTTCTGA 250 

CAAGCCTCTC CGTCTTCCTC TTCAAGATGT CTACAAGATT GGTGGTATTG 3 00 

GTACAGTTCC CGTCGGTCGT GTTGAAACTG GTACTATCAA GGCTGGTATG 350 

5 GTTGTCAACT TCGCTCCCGC TGCTGTCACC ACTGAAGTTA AGTCCGTTGA 400 

AATGCATCAC GAAACCCTCT CTGAAGGTCT CCCCGGTGAC AACGTTGGTT 450 

TCAACGTCAA GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT 500 

TCCGACTCCA AGAACGATCC CGCTAAGGAA TCTGCCTCTT TCACTGCTCA 55 0 

AGTTATTATC TTGAACCATC CCGGTCAAAT CTCTGCTGGT TACGCACCAG 600 

10 TTCTCGATTG TCACACTGCT CACATCGCCT GTAAGTTCTC TGAACTCATT 650 

GAGAAGATTG ATCGTCGTTC CGGTAAGTAC CTGCATCTGT CAGAATTGAA 7 00 

GGTCCGCCGT TATAGCAAAG GCTGGGTTTA AATGTTGGGG TTTGTCTGAT 750 

C T AT AATG AT GATTGCTCCT TCAATTTTTG ACATAATTTG ATGATCTGAA 8 00 

TTGTGTTGCT AACGTCGCAT TTGC TTCTTT TGCTTCTTTT GCATGTAGGT 850 

15 AAGAAGATGG AAGATGCTCC CAAAGTAAGT ATTACGATTG ATGGACAATT 900 

AAAATAGAAT ACTAACAATT ATTGTTTATA GTTCGTCAAG TCTGGTGACT 9 50 

CTGCTATCGT CAAGATGGTT CCCTCCAAG 979 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 785 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1099 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Phialaphora verrucosa 

(B) STRAIN: ATCC 3 8561 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785 

GGACAAGCTG AAGGCCGAAC GTGAGCGTGG TATCACCATC GATATCGCGC 50 

TCTGGAAGTT CGAGACTCCC AAATACTTCG TCACCGTCAT TGATGCCCCT 100 

GGTCATCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 15 0 

40 TTGTGCCATC CTCATCATTG CCGCCGGTAC CGGTGAGTTC GAAGCCGGTA 2 00 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTCCTCGC CTACACCCTA 2 50 

GGTGTGAAGC AGC TTATCGT CGCCATCAAC AAGATGGACA CCGCCAAATG 3 00 

GTCCGAGGAT CGGTTCAACG AAATCATCAA GGAGACTTCC AACTTCATCA 35 0 

AGAAGGTCGG ATACAACCCC AAGTCCGTCC CGTTCGTGCC CATCTCCGGT 400 

45 TTCAACGGTG ACAACATGAT CGACGTCTCC TCCAACGCCC CCTGGTACAA 45 0 

GGGTTGGGAG AAGGAGACCA AGGCCGGCAA GGCCACTGGC AAGACCCTCC 500 

TCGAGGCCAT CGACGCGATT GACCCTCCTA CTCGTCCCAC CGACAAGCCC 550 

CTCCGTCTCC CTCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGCACGGT 600 

GCCCGTTGGT CGTGTTGAGA CCGGTACCAT CAAAGCCGGT ATGGTCGTCA 65 0 

50 CCTTCGCTCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAAATGCAC 7 00 

CACGAACAGC TCGCCGAAGG TGTTCCAGGT GACAATGTCG GCTTCAACGT 750 

CAAGAACGTC TCCGTCAAGG AGGTTCGTCG TGGAAACGTT GCCGGTGACT 800 

CCAAGAATGA CCCCCCCAAG GGTGCCGACT CCTTCAACGC TCAGGTCATC 850 

GTCCTCAACC ACCCTGGTCA GGTCGGTGCC GGCTACGCCC CGGTCTTGGA 900 

55 TTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTC CTCGAGAAGA 950 

TCGATCGTCG TACCGGCAAG TCCATGGAAA ACAACCCCAA GTTTATCAAG 1000 

TCTGGTGATG CTGCCATCGT GAAGATGGTT CCCAGCAAGC CTATGTGCGT 1050 

TGAGGCCTTC ACCGACTATC CTCCTCTTGG TCGTTTCGCC GTCCGTGAC 1099 

60 
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2 ) INFORMATION FOR SEQ ID NO: 786 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saksenaea vasiformis 

(B) STRAIN: ATCC 60625 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 786 



ACCACCAAGT GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGACCTC 50 

CGGYTTCATC AAGAAGGTCG GCTTCAACCC CAAGCACGTT CCCTTCGTGC 100 

2 0 CCATCTCCGG TTTCAACGGT GACAACATGA TCGACGTCTC CACCAACTGC 150 

CCCTGGTACA AGGGTTGGGA GAAGGAGACC AAGGCCAAGG CCACCGGCAA 200 

GACCMTCCTC GAGGCCATTG ACGCCATCGA CCCCCCYAGY CGTCCCACCG 2 50 

ACAAGCCCCT YCGTCTTCCC CTMCAGGATG TYTACAAGAT TGGCGGTATT 3 00 

GGCACGGTTC CCGTCGGTCG TGTYGAGACC GGTRCCATCA AGGGTGGCAT 350 

2 5 GGTCGTCACC TTCCCCCCCG CTGGTGTCAC CACTGAGGTC AAGTCCGTCG 400 

AGATGCACCA CGAGCAGCTC GCCGAGGGTS TCCCCGGTGA CAACGTCGGC 450 

TTCAACGTCA AGAACGTCTC CGTCAAGGAG ATCCGTCGTG GCAACGTTGC 5 00 

CGGTGACTCC AAGAACGACC CCCCCAAGGG CTGCGACAGC TTCAACGCCC 550 

AGGTCATCGT CCTCAACCAC CCCGGTCAGG TCGGCGCCGG CTACGCGCCS 600 

3 0 GTCCTSGACT GCCACACTGC TCACATTGCC TGCAAGTTCT CTGAGCTCCT 650 

-CGAGAAGATC GACCGCCGTT CCGGCAAGTC "CATCGAGTCC GGCCCCAAGT 7 00 

TCATCAAGTC TGGTGACGCC GCCATCGTCA AG ATGGTTC C CTCCAAGCCC 7 50 



35 



45 



2 ) INFORMATION FOR SEQ ID NO: 787 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1084 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sync ephal a strum racemosum 

(B) STRAIN: ATCC 32330 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787 

GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC CACGTCACCG TCATTGATGC CCCCGGCCAT 100 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CTGACTGCGG 150 

55 TATCCTCATC ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 2 00 

AGGATGGCCA GACCCGTGAG CACGCTCTGC TTGCCTTCAC CCTCGGTGTC 250 

CGTCAGCTGA TCGTCGCCAT CAACAAGATG GACTCGACCA AGTACTCTGA 3 00 

GGCCCGTTAC AACGAAATCG TCAAGGAGGT CTCCACCTTC ATCAAGAAGA 350 

TCGGTTTCAA CCCCAAGTCC GTTCCCTTCG TCCCCATCTC TGGCTGGAAC 400 

60 GGTGACAACA TGTTGGAGGA GTCCTCCAAC ATGCCCTGGT TCAAGGGCTG 450 
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GAAGAAGGAG ACCAAGGCTG GCGAGAAGTC CGGCAAGACC CTCCTTGAGG 500 

CCATTGACAA CATTGACCCC CCGGTCCGTC CCTCGGACAA GCCCCTCCGT 550 

CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATCGGCA CAGTCCCCGT 600 

CGGTCGTGTC GAGACTGGTG TCATCAAGGC TGGTATGGTC GTGACCTTCG 650 

5 CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 7 00 

CAGCTCGTCG AGGGTGTCCC CGGTGACAAC GTCGGTTTCA ACGTCAAGAA 750 

CGTTTCCGTC AAGGATATCC GCCGTGGTAA CGTCTGCTCT GACTCCAAGA 800 

ACGACCCCGC CAAGGAGTCT GCCTCGTTCA CCGCCCAGGT CATCGTCCTG 85 0 

AACCACCCCG GTCAGATCGG TGCCGGTTAC GCCCCGGTTC TTGACTGCCA 900 

10 CACCGCTCAC ATTGCCTGCA AGTTCGCTGA GCTCCTCGAG AAGATCGACC 950 

GTCGTTCCGG YAAGAAGCTC GAAGAGTCCC CCAAGTTCGT CAAGTCGGGT 1000 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 105 0 

CT AC AC TG AG TACCCCCCTC TTGGCCGTTT CGCC 1084 



15 
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2 ) INFORMATION FOR SEQ ID NO: 788 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 1155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) "SEQUENCE 'DESCRIPTION: SEQ ID NO : 788 



GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 5 0 

AGTTC GAG AC CCCCAAGTAC AATGTC AC CG TCATTGGTAT GTTTTTCTTT 10 0 

35 ACCTTTCCCC TCCATCGTCT TGCTGTGCCA TAACTAACGA GAGTAGACGC 15 0 

CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG 200 

CTGACTGTGC TATTCTCATC ATTGCTGCCG GTACTGGTGA GTTCGAGGCT 25 0 

GGTATC TCC A AGGATGGCCA GACCCGTGAG CACGCTCTGC TCGCCTTCAC 300 

CCTTGGTGTC AAGCAGCTCA TCGTTGCCAT CAACAAGATG GACACCACCA 350 

40 ACTGGTC CG A GGACCGTTTC AAGGAAATCA TCAAGGAAGT CACCAACTTC 400 

ATCAAGAAGG TTGGCTACGA CCCCAAGGGT GTTCCATTCG TTCCAATCTC 45 0 

TGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCAGCAAC TGCCCATGGT 500 

ACAAGGGATG GAACAAGGAG ACCAAGGCCG GTGGTGCCAA GACTGGCAAG 55 0 

ACCCTYCTCG AGGCCATCGA TGCCATCGAC ATGCCAACCC GTCCTACCGA 600 

45 CAAGCCCCTY CGTCTCCCAC TCCAGGATGT CTACAAGATC TCTGGTATCG 65 0 

GAACTGTACC AGTCGGTCGT GTTGAGACCG GTATCATCAA GCCTGGTATG 7 00 

GTCGTCACCT TCGCCCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA 75 0 

AATGCACCAC CAGCAGCTTC AGCAGGGTGT CCCCGGTGAC AACGTCGGCT 800 

TCAACGTCAA GAACGTTTCC GTCAAGGAAG TCCGCCGTGG TAACGTTGCC 85 0 

50 GGTGACTCCA AGAACGACCC ACCATCCGGC TGTGCCTCCT TCAACGCCCA 900 

GGTCATCGTC CTCAACCACC CCGGCCAGAT CGGTGCTGGT TACGCCCCAG 95 0 

TCCTCGACTG CCACACTGCT CACATTGCTT GCAAGTTCGC TGAGCTCCTC 1000 

GAGAAGATTG ACCGCCGTAC CGGTAAATCC GTCGAAGCCA ACCCCAAGTT 105 0 

CGTCAAGTCT GGTGATGCCG CTATCGCCAA GATGGTTCCC TCCAAGCCCA 1100 

55 TGTGCGTTGA GGCTTTCACT GACTACCCCC CACTTGGTCG TTTCGCCGTC 1150 

CGTGA 1155 



60 2 ) INFORMATION FOR SEQ ID NO: 7 89 

653 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1138 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 



15 



40 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 789 



TCAAGGCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACCC CCAAGTACAA TGTCACCGTC ATTGGTATGT TTCTCTTTAC 100 

CTTTCCCCTC CATCGTCTTG CTGTGCCATA ACTAACGAGA GTAGACGCCC 150 

CCGGTCACCG TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 2 00 

20 G AC TGTGCT A TTCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG 250 

TATCTCCAAG G ATGGC C AG A CCCGTGAGCA CGCTCTGCTC GCCTTCACCC 3 00 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAC 3 50 

TGGTCCGAGG ACCGTTTCAA GGAAATCATC AAGGAAGTCA CCAACTTCAT 400 

CAAGAAGGTT GGCTACGACC CCAAGGGTGT TCCATTCGTT CCAATCTCTG 450 

2 5 GTTTCAACGG TGACAACATG ATTGAGGCCT CCACCAACTG CCCATGGTAC 500 

AAGGGATGGA ACAAGGAGAC CAAGGCCGGT GGTGCCAAGA CTGGCAAGAC 550 

CCTCCTCGAG GCCATCGATG C CATC G AC AT GCCAACCCGT CCTACCGACA 6 00 

AGCCCCTCCG TCTCCCACTC CAGGATGTCT ACAAGATCTC TGGTATCGGA 650 

ACTGTACCAG TCGGTCGTGT TG AG AC CGGT ATCATCAAGC CCGGTATGGT 7 00 

3 0 CGTCACCTTC GCCCCTGCCA ACGTCACCAC TGAAGTCAAG TCCGTCGAAA 7 50 

TGCACCACCA GCAGCTTCAG "CAGGGTGTCC CCGGTGACAA CGTCGGCTTC 800 

AACGTCAAGA ACGTTTCCGT CAAGGAAGTC CGCCGTGGTA ACGTTGCCGG 850 

TGACTCCAAG AACGACCCAC CATCCGGCTG TGCCTCCTTC AACGCCCAGG 9 00 

TCATCGTCCT CAACCACCCC GGCCAGATCG GTGCTGGTTA CGCCCCAGTC 950 

35 CTCGACTGCC ACACTGCTCA C ATTGC TTGC AAGTTCGCTG AGCTCCTCGA 1000 

GAAGATTGAC CGCCGTACCG GTAAATCCGT CGAAGCCAAC CCCAAGTTCG 1050 

TCAAGTCTGG TGATGCCGCT ATCGCCAAGA TGGTTCCATC CAAGCCTATG 1100 

TGCGTTGAGG CTTTCACTGA CTACCCCCCA CTTGGTCG 1138 



2 ) INFORMATION FOR SEQ ID NO: 790 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 748 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bipolaris hawaliensis 

(B) STRAIN: ATCC 26067 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790 



CACCACCAAG TGGTCTGAGG AGCGTTACCA GGAAATCATC AAGG AG AC CT 5 0 

CCAACTTCAT CAAGAAGGTC GGCTACAACC CCAAGCACGT TCCCTTCGTC 100 

60 CCCATCTCTG GTTTCAACGG AGACAACATG ATTGAGGCTT CCACCAACTG 150 
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CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC 
AGACCCTTCT CGAGGCCATC GACGCCATCG 
GACAAGCCCC TCCGCCTTCC CCTCCAGGAT 
TGGCACGGTT CCCGTCGGTC GTGTCGAGAC 
5 TGGTCGTCAC CTTCGCCCCC GCTGGTGTCA 
GAGATGCACC ACGAGCAGCT TGCCGAGGGT 
CTTCAACGTC AAGAACGTCT CCGTCAAGGA 
CCGGTGACTC CAAGAACGAC CCCCCCAAGG 
CAGGTCATCG TCCTCAACCA CCCCGGTCAG 
10 AGTCCTTGAC TGCCACACTG CCCACATTGC 
TCGAGAAGAT TGACCGCCGT ACCGGAAAGT 
TTCATCAAGT CCGGTGACGC CGCCATCGTC 



CAAGGCCAAG GCCACTGGTA 2 00 

ACCCCCCTGT CCGTCCTACC 2 50 

GTGTACAAGA TTGGTGGTAT 3 00 

CGGTATCATC AAGCCCGGTA 3 50 

CCACTGAAGT CAAGTCCGTC 4 00 

GTCCCCGGTG ACAACGTCGG 4 50 

GATCCGTCGT GGTAACGTTG 5 00 

GTTGCGAGTC CTTCAACGCC 5 50 

GTCGGTGCCG GTTACGCACC 600 

TTGCAAGTTC TCCGAGCTCC 650 

CTGTTGAGAA CTCCCCCAAG 7 00 

AAGATGGTTC CCTCCAAG 74 8 



15 



2 ) INFORMATION FOR SEQ ID NO : 791 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 958 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791 



CGC TATTGTC 
TTGGGAAATA 
CAGACTCGTG 

35 CGTTGTCTTC 
AACTGGTCGA 
GGTGAAGAGA 
ACGCCGTGAC 
TCGACACCTG 

40 ATGTCTGTCG 
TGGTCGTGTC 
TTGGAGGCTC 
TTCAAGAAGA 
CCTGCGTGGT 

45 TTCCCGGCAG 
CTGACCGAGG 
TCCCCAAGTC 
GATCTTCCGA 
GCTGACCTCA 

50 TGACAACGTC 
CTGGTCAA 



GTTGTTGCTG 
CGTCGTAAAA 
AGCATTTGCT 
GTCAACAAAA 
ACTCGAGATG 
CTCCGATCAT 
GACATCGGTA 
GATCCCCACT 
AGGAAGTGTT 
GAGCGTGGTA 
CTTCGAACCC 
GCTGTGATGA 
ATCCGACGTG 
CACCAAAGCT 
CGGAGGGTGG 
TTCATCCGTA 
GAGATTAGCG 
GCTTCCCTGA 
GAGATGATCC 



CCTCCGACGG 
CACGGCGCTT 
GCTCGCCCGC 
TCGATGCTAT 
CGTGAGCTGC 
TTTCGGTTCC 
AAGACAGAAT 
CCTCAGCGTG 
CTCTATCGCC 
TCTTGAAGAA 
AAGAAGACCA 
ATCGCGTGCT 
AAGACGTCAA 
CACGACAAGT 
TCGTCGT ACT 
CTGCAGGTAA 
AT ATATGC T A 
CGGCGACCAA 
TGAAGACCCA 



TCAGATGTAG 
ACGTTTTCGC 
CAGGTTGGTG 
TGATGATCCG 
TGAACAGCTA 
GCTCTCTGTG 
TGAGCAGCTT 
ACCTCGACAA 
GGCCGTGGTA 
GGACTC TGAG 
AAGTCACCGA 
GGTGACAACT 
GCGTGGTATG 
TCCTCGTCTC 
GGCTTCGGTG 
GTTCCCGCAC 
ATGATTCATC 
TCTCGCAGAG 
CCACCCTGTT 



GTGGAACATC 5 0 

GAATAGGCCC 100 

TCCAGAAGAT 15 0 

GAGATGCTGG 2 00 

CGGTTTCGAG 25 0 

CTCTCGAAGG 3 00 

ATGAACGCTG 35 0 

ACCTTTCTTG 4 00 

CCGTGGC TTC 4 50 

GTTGAGATTG 5 00 

CATTGAAACC 5 50 

CTGGTCTCCT 600 

GTCATTGCTG 650 

CATGTACGTC 7 00 

CCAACTACCG 7 50 

ACCGTGTCCA 800 

AGACGAGGCT 850 

TTATGCCTGG 9 00 

GCTGCTGAGG 950 
958 



55 2 ) INFORMATION FOR SEQ ID NO: 792 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792 



J-UOnVJ ^W^^J"^ * ^W**. ._w~r 

GATGATTTGC AAGACCCTCC ACCCCATTGC TGCCGA 



CGCCCAGACG 


50 


AGACCAGAGA 


100 


GTCGTTTTCG 


150 


GCTTGTCGAA 


200 


GTGAGGAGAC 


250 


CGTCGACCTG 


300 


C G AC ACCTGG 


350 


TGTCCATTGA 


400 


GGTCGTGTTG 


450 


TGGTGGCTCT 


500 


TCAAGAAGTC 


550 


CTCCGAGGTA 


600 


CCCCGGATCC 


650 


TGACTGAGGC 


700 


CCCCAAATGT 


750 


TACTAAGTAG 


800 


ATCTTTCAGC 


850 


ACAACGTCGA 


900 




936 



2 ) INFORMATION FOR SEQ ID NO: 793 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 793 

4 5 TTATTGTTGC TGC TGGTACT 



50 



2 ) INFORMATION FOR SEQ ID NO: 794 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Trypanosoma cruzi 

656 



CA 02307010 2000-05-19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 794 
GACGACAAGT CGGTGAACTT 2 0 

5 

2 ) INFORMATION FOR SEQ ID NO: 795 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



15 



20 



25 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Txypanosoma cruzi 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795 
ACTTGCACGC GATGTGGCAG 2 0 

2 ) INFORMATION FOR SEQ ID NO: 796 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 21 bases 

(B) "TYPEl~Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796 
GGTCCAATGC CWCAAACWAG A 21 

40 

2 ) INFORMATION FOR SEQ ID NO : 797 

45 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 797 
55 CATTAAGAAT GGYTTATCTG TSKCTCT 27 



2 ) INFORMATION FOR SEQ ID NO: 798 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798 

TGGTTGTCCC AGCCGATCGT TT 



2 ) INFORMATION FOR SEQ ID NO: 799 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799 

ACCTGTGAAT ACAAGCAATC T 



2 ) INFORMATION FOR SEQ ID NO : 800 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800 

GATGAAATCT TCAACGAAGT TGAT 



2 ) INFORMATION FOR SEQ ID NO: 801 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

658 
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50 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801 

ACAACACCGA GAAGATCCCA 2 0 

2 ) INFORMATION FOR SEQ ID NO: 802 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802 

TTGCCATTTC TGGTTTCGTT 2 0 

2 ) INFORMATION FOR SEQ ID NO: 803 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803 
ACTTCAGTGG TAACACCAGC 20 

2) INFORMATION FOR SEQ ID NO: 804 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 

659 
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(vi) ORIGINAL, SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804 

CCTGGGACGG CCTCTGGCAT 



2 ) INFORMATION FOR SEQ ID NO: 805 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805 

CTCTTGTCCA TCTTAGCAGT 



2 ) INFORMATION FOR SEQ ID NO: 806 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 806 

AGCATCACCA GACTTGATAA G 



2 ) INFORMATION FOR SEQ ID NO: 807 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : L inear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

660 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 807 
AAAGTGGCTT CAAAGGTTGC 2 0 

5 

2 ) INFORMATION FOR SEQ ID NO: 808 

10 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808 
2 0 GCITTAIWRG CATTAGAARA YCCA 2 4 



25 



55 



2 } INFORMATION FOR SEQ ID NO: 809 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809 

35 

TCTTCCTGTW GCAACTGTTC CTCT 24 



40 2 ) INFORMATION FOR SEQ ID NO : 810 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810 
50 AGAGMWACAG ATAARSCATT CTTA 24 



2 ) INFORMATION FOR SEQ ID NO : 811 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811 
TRAARTAGAA TTGTGGTCTR TATCC 25 



10 2 ) INFORMATION FOR SEQ ID NO: 812 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812 

GTIACIGGIT CYTYRARRTT ICCICC 26 

25 

2 ) INFORMATION FOR SEQ ID NO : 813 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
30 (B) TYPE: Nucleic acid 

— PC) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 



35 



40 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813 
AATCYGTY GA AATGCAYCAC GA 22 

2 ) INFORMATION FOR SEQ ID NO: 814 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814 
GCIGGCACGT ACACIGCCTG 20 

55 

2 ) INFORMATION FOR SEQ ID NO : 815 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION:* SEQ ID NO : 815 
10 TGGTGCATYT CKACRGACTT 20 



15 



2 ) INFORMATION FOR SEQ ID NO : 816 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816 

25 

GCTACGACGA GATCAAGGGC 2 0 



3 0 . 2 ) INFORMATION FOR SEQ ID NO: .817 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817 

TGGAAGAAGG CCGAGGAGTT 2 0 



45 



55 



60 



2 ) INFORMATION FOR SEQ ID NO : 818 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
5 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818 
AGCCGGGCTG GATCTTCTTC 2 0 



663 
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2 ) INFORMATION FOR SEQ ID NO: 819 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819 
TCGAGCTTCT GGAGGAAGAG 



2 ) INFORMATION FOR SEQ ID NO: 820 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820 

GAAGGAGGTG TCTGCTTACA C 



2 ) INFORMATION FOR SEQ ID NO: 821 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821 

GGCGCAAACG TCACCACATC A 



2 ) INFORMATION FOR SEQ ID NO: 822 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
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10 



30 



40 



45 



(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822 

CGGCGGATGT CCTTAACAGA A 21 



15 2 ) INFORMATION FOR SEQ ID NO: 823 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 3 

GAGCGGTATG AYGAGATTGT 2 0 



2) INFORMATION FOR SEQ ID NO: 824 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 824 
GGCTTCTGCG GCACCATGCG 2 0 

2 ) INFORMATION FOR SEQ ID NO: 825 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 82 5 
ATGAGCARCG SAACCATCGT TCAGTG 26 

60 
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2) INFORMATION FOR SEQ ID NO: 82 6 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

10 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826 
15 TCGATCGTGC CGACCATGTA GAACGC 



2 ) INFORMATION FOR SEQ ID NO: 827 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827 

3 5 CACCAACTTG CTAAATGGGG AG ATGCC C AG ATTGTTGTAT ATATAGGCTG 50 

TGGAGAACGT GGAAATGAAA TGACAGATGT TCTTAATGAG TTTCCAGAAC 100 

TTAAAGATCC TAAGACTGGC AAATCAATAA TGGAAAGAAC AGTTTTAATA 150 

GCAAATACTT CTAATATGCC AGTTGCAGCC CGTGAAGCTT GTATATATAC 2 00 

AGGAATCACA ATAGCAGAAT ATTTTAGAGA TATGGGATAT TCAGTAGCAC 2 50 

40 TTATGGCGGA TTCCACTTCA CGTTGGGCAG AGGCATTAAG AGAAATGTCT 300 

GGAAGACTTG AAGAAATGCC TGGTGATGAA GGTTACCCAG CTTATTTAGG 3 50 

ATCAAGACTT GCTGATTTCT ATGAAAGAGC TGGAAAAGTT GTGTGTTTAG 400 

GAGACGATGA AAGAGAAGGT GCCATTACTG CAATAGGTGC TGTATC 446 



45 



60 



2 ) INFORMATION FOR SEQ ID NO : 82 8 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: 9689 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828 

CAGCATCAGC TTGCTAAATG GGCAGATGCA GATATAGTTG TATATATAGG 50 

CTGTGGCGAG CGTGGAAATG AAATGACAGA TGTTCTTCTT GAATTTCCTG 100 

5 AATTAAAAGA CCCAAGAACA GGCGAGTCAC TTATGCAAAG AACTGTGCTT 150 

ATAGCAAATA CATCAGATAT GCCGGTTGCT GCACGTGAAG CTTCTATATA 200 

CACTGGTATT ACAATAGCTG AATATTTTAG AGATATGGGA TATAGTGTTG 250 

CACTTATGGC AGACTCTACA TCAAGATGGG CTGAGGCTCT TAGAGAGATG 300 

AGTGGTCGTT TAGAGGAGAT GCCTGGTGAA GAAGGTTATC CTGCATACTT 350 

10 AGGTTCACGT CTTGCTCAAT TCTATGAGAG AGCAGGAAAG GTAAATTGTC 400 

TAGGTATGGA TGAAAGAGAA GGAACACTTA CAGCAATTGG TGCAG 445 



15 2) INFORMATION FOR SEQ ID NO: 829 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 445 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 829 



ATGC TATAGC TAAATGGGGA GACAGCGAAA TAGTTGTTTA CGTTGGATGT 50 

GGAGAACGTG GTAACGAAAT GACAGACGTT CTTAACGAAT TCCCAGAACT 100 

TATTGACCCA AAAAC TGGGG AAAGTTTAAT GAAGAGAACA GTAC TTAT AG 150 

CTAATACTTC AAAC ATGC C A GTTGCTGCTA GAGAAGCTTG CATATACACA 2 00 

3 5 GGTATTACAA TAGCTGAATA CTTCAGAGAT ATGGGATACT CAGTATCTAT 250 

AATGGC TG AT TCAACTTCAA GATGGGCAGA AGCATTAAGA GAAATGTCAG 300 

GTAGACTTGA AG AAATGC C A GGTGATGAAG G ATATC C AG C GTAC TTAGGA 35 0 

TCAAGACTTG CTGATTATTA CGAAAGAGCA GGTAAGGTTG TTTGTCTAGG 400 

TAAAGATGGT AGAGAAGGTG CTGTAACAGC AATTGGAGCT GTATC 44 5 

40 



2 ) INFORMATION FOR SEQ ID NO: 83 0 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 
55 (B) STRAIN: 20:3.1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 0 

TCAAATTGCT AAATGGGGAG ATGCAGAAAT CGTTGTTTAC GTTGGATGCG 50 
60 GAGAACGTGG TAACGAAATG ACAGACGTTG TTAATGAGTT CCCAGAACTT 100 
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ATTGACCCTA AGACTGGCGA AAGCTTAATG 
TAATACTTCA AACATGCCAG TTGCAGCGAG 
GTATCACAAT AGCTGAATAT TTCAGAGATA 
ATGGCTGACT CAACTTCAAG ATGGGCTGAG 
5 TAGACTTGAA GAAATGCCTG GTGATGAAGG 
CAAGACTTGC TGATT AC TAT GAAAGAGCTG 
AATGATGGAA GAATTGGTTC TATAACAGCA 



AAGAG AAC AG TTCTTATAGC 150 

AGAAGCTTCA ATATATACAG 200 

TGGGATATGC AGTATCAATA 25 0 

GC ATT AAGAG AAATGTCTGG 300 

ATATCCAGCT TACCTTGGAT 350 

GTAAGGTTGA ATGTTTAGGT 400 

ATCGGTGCGG TATC 444 



10 2 ) INFORMATION FOR SEQ ID NO: 831 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 831 



TAGTTCAGCA CCAAGTTGCT AAATGGGGAG ATACTGAGAT AGTTGTTTAC 50 

GTTGGATGTG GAGAACGTGG TAACGAGATG ACAGACGTTC TTAACGAATT 100 

CCCAGAACTT AAAGACCCTA AAAC TGGGG A AAGCTTAATG AAGAGAACAG 15 0 

TTCTTATTGC TAATACATCT AACATGCCAG TTGCTGCCAG AGAAGCATCA 2 00 

3 0 AT ATATAC TG GTATAACAAT AGCAGAGTAT TTCAGAGATA TGGGATACTC 250 

AGTATCAATC^TGGCTmC^CAACTTCACG TTGGGCAGAG GCTTTAAGAG 3 00 

AAATGTCAGG AAGACTTGAA G AAATGC C AG GAGACGAAGG TTACCCAGCA 3 50 

TACTTAGGAT CAAGACTTGC TGATT AC TAT GAAAGAGCTG GTAAAGTTGT 400 

AGCTTTAGGT AAAGATGGAA GAGAAGGAGC TGTTACAGCT ATCGGAGCAG 450 

35 TATCCC 45 



2 ) INFORMATION FOR SEQ ID NO: 832 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridixwi tetani 
50 (B) STRAIN: ATCC 19406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832 

CCAACTTGCA AAATGGGCTG ATGCTCAAAT AGTTGTGTAC ATAGGATGTG 50 

55 GAGAACGTGG AAATGAAATG ACAGACGTTT TAAATGAGTT CCCAGAATTA 100 

AAGGATCCTA AAACCGGGGA ATCTTTAATG AAAAGAACTG TGTTAATAGC 150 

AAATACATCT AATATGCCTG TTGCAGCTAG AGAAGCATCT AT AT AT AC TG 200 

GTATAACAAT AGGGGAATAT TTTAGAGATA TGGGATATTC AATAGCACTA 250 

ATGGCAGATT CG ACTTC TAG ATGGGCAGAG GCTCT AAGAG AAATGTCTGG 3 00 

60 AAGACTAGAG GAG ATGC C AG GTGAAGAAGG TTATCCAGCT TATTTAGGAT 3 50 
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15 



20 



CTAGATTAGC AGAGTTCTAT GAAAGAGCAG GTAATGTTAT ATGTTTAGGT 4 00 

CAGGATGGAA GAGAAGGAGC ATTAACAGCT ATAGGAGCAG TTTC 444 



2 ) INFORMATION FOR SEQ ID NO: 83 3 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1786 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833 



TGAACCAAGG AAAAATAATA ACCGTTTCGG GACCTCTTGT TGTGGC TTCT 5 0 

GGGATGCAAG AAGCTAATAT TCAAGATATT TGTCGTGTGG GACATCTTGG 100 

CTTAGTCGGA GAAATTATTG AAATGCGTCG CGATCAAGCG TCTATTCAGG 150 

TTTATGAGGA AACATCAGGG ATCGGTCCAG GAGAACCAGT AGTGAC T AC T 2 00 

2 5 GGTTGTCCTT TGTCGGTCGA GTTAGGCCCG GGCCTGATTT CAGAAATGTT 2 50 

TGACGGTATT CAGCGACCGC TTGATCGTTT TCAAAAAGCA ACGGACAGCG 3 00 

ACTTTTTAAT CCGTGGTGTG GCTATCCCAA GTCTTGATCG AAAGGCTAAG 3 50 

TGGGCATTTA TTCCCAAGCT AAGTGTTGGT CAAGAAGTAG TTGCAGGTGA 4 00 

TATTTTAGGA ACTGTGCAAG AAACAGCTGT CATTGAGCAC CGTATCATGG 45 0 

3 0 TTCC TTATAA AGTTTCAGGG ACCTTGGTGG CTATTCATGC AGGGG AC TTC 500 

ACAGTAACAG ATACAGTTTA TGAAATTAAG CAGGAAGACG GTTCCATTTA 55 0 

CCAAGGTAGC CTCATGCAGA CTTGGCCAGT TCGTCAAAGT CGCCCTGTTG 600 

CTCAAAAGCT TATCCCAGTC GAACCTTTGG TTACAGGTCA ACGGGTTATT 650 

GACACCTTTT TCCCTGTTAC AAAAGGTGGT GCCGCTGCCG TTCCTGGACC 700 

35 ATTTGGGGCA GGAAAAACAG TTGTGCAGCA TCAAATAGCT AAATTTGC CA 750 

ACGTTGATAT TGTTATTTAT GTCGGTTGTG GGGAACGCGG CAACGAGATG 800 

ACCGACGTTT TGAATGAGTT TCCAGAGTTA ATTGACCCAA ATACAGGCCA 850 

GTCCATTATG GAGCGCACGG TGTTAATTGC AAACACCTCT AATATGCCAG 900 

TAGCAGCGCG TGAAGCGTCG ATTTACACAG GTATTACCAT TGCCGAATAT 95 0 

40 TTCCGTGATA TGGGCTATTC TGTGGCTATC ATGGCAGACT CGACATCACG 1000 

TTGGGCAGAA GCTCTGCGCG AGATGTCAGG ACGCCTACAA GAAATGCCTG 1050 

GTGATGAAGG CTACCCGGCT TACTTAGGGA GTCGTATTGC CGAATATTAT 1100 

GAACGGGCTG GTCGTGTTCG G AC C TTGGG A AGTCAAGAAC GTGAGGGAAC 115 0 

CATTACAGCC ATCGGCGCGG TTTCTCCTCC TGGAGGGGAT ATTTCAGAGC 120 0 

45 CTGTCACTCA AAAC AC CCTT CGGATTGTCA AAGTTTTCTG GGGGCTCGAC 1250 

GCGCCTCTTG CGCAACGGCG TCACTTCCCA GCGATTAACT GGCTGACGTC 13 00 

TTATTCATTG TATCAAGATG ATGTAGGAAG C T ATATTG AC CGTAAACAGC 1350 

AATCTAATTG GTCCAACAAG GTAACTCGTG CCATGGCTAT TTTGCAGCGT 1400 

GAAGCCAGTC TAGAAGAAAT TGTACGCTTG GTGGGGCTTG ATTCACTGTC 1450 

50 TGAACAAGAT CGTTTGACCA TGGCTGTTGC CCGGCAAATT CGGGAGGATT 1500 

ATCTCCAGCA AAATGCCTTT GATTCGGTGG ATACCTTTAC TTCCTTTCCG 155 0 

AAACAAGAGG CCATGCTAAC CAATATTTTG AC C TTTAATG AGGAAGCCAG 1600 

CAAAGCCCTT TCTTTGGGAG CTTATTTTAA TGAGATTATG GAAGGCACTG 165 0 

CTCAGGTACG CGATCGCATC GCACGCAGCA AATTTATCCC AGAAGAAAAC 17 00 

55 TT AG AG C AG A TTAAAGGGCT TACTCAGAAG GTTACCAAAG AGATTCACCA 17 5 0 

CGTTTTAGCA AAGGGAGGAA TTTAGATGAG CGTTCT 1786 



60 2 ) INFORMATION FOR SEQ ID NO : 834 
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(l , SEQUENCE ^^STSS 081 
< A) Sg? B iailei« acid 
lB l SSnDEDNBSS: Double 
g S^GY: linear 

„ TV p E - Genomic DNA 
MOLECULE TYPE. 

< B) cv.O ID 




30 



35 



QPO id NO: 835 
2 ) INFORMATION FOB. SEQ 

-rm.E TYPE: Genomic DNA 
MOliECULE T»r 



50 
100 
150 
200 
250 
300 
350 
400 
450 
499 



50 



55 




40 SM » *° S °~ 50 

100 
150 
200 

45 ^I^TTAT ACTATGO™ ~^ TA TGC CTO£-££ A TATGGGA^ «J 

350 
400 
450 
464 
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C170 id NO: 836 
INFORMATION FOR SEQ ID 




(oX TYPE'- nucleic £g£ le 

1b TYPE- Genome DNA 
MOLECULE Tx 

» cnURCE: . • - 




50 
100 
150 
200 
250 
300 
350 
400 
446 



25 



30 



( D > „; r DNA 

« rrvpE: Genomic 
MOLECULE TYPE- 

cmiRCB : . = 




50 
100 
150 
200 
250 
300 
350 
400 
450 
456 



55 



fin 



... SEQUENCE CHARft baS es 
(1 ' LENGTH: aC id 

(D) TOPOLOGY • € 
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(ii) 



MOLECULE TYPE- Genomic DNA 

(B) STRAIN: MOU-2 
(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 838 



20 



(xi) SEQUrJN<-c 

_„ c„»K» "™ag SSSSS SSSot 

Hiss sssss KgE sggjg ssss 
IS is g| USE S| 

2) INFORMATION FOR SEQ ID NO: 839 



m SEQUENCE CHARACTERISTICS. 
U) ( A ) LENGTH : 437 bases 
25 2' TYPE: Nuclexc acid 

r STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 



j0 (ii) MOLECULE TYPE: Genomic DNA 

< vi) STi- * Trypanosoma brucei 
(B) STRAIN: EATRO 795 
35 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 839 

GCACTCTCGA AGTACTCGAA CAGTGACGCT GTTATTTACG TGGGTTGTGG 
TGAGCGTGGC AATGAGATGG CGGAGGTGCT CATGGACT^ caCATG cCTG 
„n PP AC C ATT AT TGATGGACGT GAGGAGTCTA ^ GTGAGG C ATCTATTTA 

40 ?SSgcaaata cttccaatat gcctgttgct gctcgt^ ^cacatcG 

S^CGTATC ACCTTAGCTG AGTATTATCG GGATATC*^ GCGTGAGATT 

SSgatogc cgattcaacc tcccgctggg ctgaggcttt g CGCGTACCT 
5cSgtcgtc ttgctgaaat gcctgcagat ggaggtT cgt gtgacatcca 

45 t Stgggc? S3SSSSS GGGTCTGTAA caattgt 



50 



2) INFORMATION FOR SEQ ID NO: 840 

(i) SEQUENCE CHARACTERISTICS: 
U) (A) LENGTH: 1052 bases 
(B) TYPE: Nucleic acid 
c STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 (vi ) ORIGINAL SOURCE: ^ 



50 
100 
150 
200 
250 
300 
350 
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(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 84 0 

5 

TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GACGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 2 00 

10 GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 2 50 

GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTCG 3 00 

CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACC TGAAG 3 50 

AAGGTTGGGT ACAACGTGGA GAAGGTGCGC TTCATCCCCA TCTCCGGCTG 400 

GCAGGGCGAC AACATGATTG ACAAGTCGGA AAATATGCCG TGGTACAAGG 450 

15 GCCCCACGCT GCTGGAGGCA CTCGACATGC TGGAGCCCCC GGTGCGCCCC 5 00 

AGCGACAAGC CGCTGCGCCT GCCGCTGCAG GACGTGTACA AGATCGGCGG 55 0 

TATCGGCACC GTGCCGGTCG GTCGCGTGGA GACGGGCACG ATGAAGCCCG 600 

GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA GGTGAAGTCG 650 

ATTGAGATGC ACCACGAGCA GCTGGCCGAG GCCACGCCCG GCGACAACGT 7 00 

2 0 CGGCTTCAAC GTGAAGAACG TGTCCGTGAA GGACATCCGC CGTGGCAACG 7 50 

TGTGCGGCAA CTCGAAGAAC GACCCCCCAA AGGAGGCGGC CGACTTCACG 8 00 

GCGCAGGTGA TCATCCTGAA CCACCCCGGC CAGATCGGCA ACGGCTATGC 850 

GCCGGTGCTC GACTGCCACA CCTGCCACAT CGCGTGCAAG TTCGCCGAGA 900 

TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTTGA GAAGAACCCC 950 

2 5 AAGTCGATCA AGTC CGGTG A CGCCGCCATG GTGCGCATGG TGCCGCAGAA 1000 

GCCCATGTGC GTGGAGGTGT TCAACGACTA CGCTCCTCTT GGCCGCTTTG 1050 

CC 1052 
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2 ) INFORMATION FOR SEQ ID NO: 841 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1061 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: CGL-1 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841 

TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GATGCCGCCG 150 

50 TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 2 00 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 2 50 

GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTTG 3 00 

CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 3 50 

AAGGTTGGGT ACAACGTGGA GAAGGTGCGC TTCATCCCCA TCTCCGGCTG 400 

55 GCAGGGCGAC AACATGATTG ACAAGTCGGA AAATATGCCG TGGTACAAGG 450 

GCCCCACGCT GCTGGAGGCA CTCGACATGC TGGAGCCCCC GGTGCGCCCC 500 

AGCGACAAGC CGCTGCGCCT GCCGCTGCAG GATGTGTACA AGATCGGCGG 550 

TATCGGCACC GTGCCGGTCG GTCGCGTGGA GACGGGCACG ATGAAGCCCG 600 

GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA GGTGAAGTCG 650 

60 ATTGAGATGC ACCACGAGCA GCTGGCCGAG GCCACGCCCG GCGACAACGT 700 
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CGGCTTCAAC 
TGTGCGGCAA 
GCGCAGGTGA 
GCCGGTGCTT 
5 TCGAGTCCAA 
AAGTCGATCA 
GCCCATGTGC 
CCGTGCGTGA 



GTGAAGAACG 
CTCGAAGAAC 
TCATCCTGAA 
GACTGCCACA 
GATCGACCGC 
AGTCCGGTGA 
GTGGAGGTGT 
C 



TGTCCGTGAA 
GATCCCCCAA 
CCACCCCGGC 
CCTGCCACAT 
CGCTCCGGCA 
CGCCGCCATG 
TCAACGACTA 



GGACATCCGC 
AGGAGGCGGC 
CAGATCGGCA 
CGCGTGCAAG 
AGGAGCTTGA 
GTGCGCATGG 
CGCTCCTCTT 



C GTGGC AACG 7 50 

CGACTTCACG 800 

ACGGCTATGC 850 

TTCGCCGAGA 9 00 

GAAGAACCCC 9 50 

TGCCGCAGAA 1000 

GGCCGCTTTG 1050 
1061 



10 



15 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 842 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842 



GCTGAAGGCT 
AGTTCGAGTC 
30 . CGCGACTTCA 
CGTCCTTGTC 
AGGACGGCCA 
AAGC AG ATGG 
CGCCCAGGAG 
3 5 AGAAGGTTGG 
TGGCAGGGCG 
GGGCCCCACG 
CCAGCGACAA 
GGTATCGGCA 
40 CGGCGACGTG 
CGATTGAGAT 
GTCGGCTTCA 
CGTGTGCGGC 
CGGCGCAGGT 
45 GCGCCGGTGC 
GATCGAGTCC 
CCAAGTCGAT 
AAGCCCATGT 
TGCCGTGCGT 

50 



GAGCGCGAGC 

GCCCAAGTCT 

TCAAGAACAT 

ATTGCGTCAT 

GACACGCGAG 

TTGTGTGCTG 

CGCTACGATG 

GTACAACGTG 

ACAACATGAT 

CTGCTGGAGG 

GCCGCTGCGC 

CCGTGCCGGT 

GTGACGTTTG 

GCACCACGAG 

ACGTGAAGAA 

AACTCGAAGA 

GATCATCCTG 

TCGACTGCCA 

AAGATCGACC 

CAAGTCCGGT 

GCGTGGAGGT 

GA 



GCGGCATCAC 

GTGTTCACGA 

GATCACGGGC 

CGCAGGGTGA 

CACGCGCTGC 

CAACAAGATG 

AGATTGTGAA 

GAGAAGGTGC 

TGACAAGTCG 

CACTCGACAT 

CTGCCGCTGC 

CGGTCGCGTG 

CGCCCGCCAA 

CAGCTGGCCG 

CGTGTCCGTG 

ACGACCCCCC 

AACCACCCCG 

CACCTGCCAC 

GCCGCTCCGG 

GACGCCGCCA 

GTTCAACGAC 



GATCGACATC 

TCATCGACGC 

ACGTCTCAGG 

GTTTGAGGCG 

TCGCCTTCAC 

GACGACAAGT 

GGAGGTGTCG 

GCTTCATCCC 

GAAAATATGC 

GCTGGAGCCC 

AGGACGTGTA 

GAGACGGGCA 

CGTGACGACG 

AGGCCACGCC 

AAGGACATCC 

AAAGGAGGCG 

GCCAGATCGG 

ATCGCGTGCA 

C AAGG AGC TT 

TGGTGCGCAT 

TACGCTCCTC 



GCGCTCTGGA 50 

CCCCGGCCAC 100 

CGGACGCCGC 150 

GGCATCTCGA 200 

GCTCGGCGTG 250 

CGGTGAACTT 300 

GCGTACCTGA 3 50 

CATCTCCGGC 400 

CGTGGTAC AA 450 

CCGGTGCGCC 500 

CAAGATCGGC 550 

CGATGAAGCC 600 

GAGGTGAAGT 650 

CGGCGACAAC 7 00 

GCCGTGGCAA 7 50 

GCCGACTTCA 800 

CAACGGCTAT 850 

AGTTCGCCGA 900 

GAGAAGAACC 950 

GGTGCCGCAG 1000 

TTGGCCGCTT 1050 
1062 



2 ) INFORMATION FOR SEQ ID NO: 843 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 bases 

(B) TYPE: Nucleic acid 

< C ) STRANDEDNESS : Double 
(D) TOPOLOGY : Linear 



674 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843 



TACATTGACA 
10 AGTACACTCC 
GGTATTACTA 
TTACGGTCAC 
TATCTGGCGC 
GACGGTCCCA 
15 TGGTGT AC C T 
ACTCTGAGCT 
GAGTTTGGTT 
AAAGGCATTG 
TATTGGATGC 

2 0 ATGCCGCTCT 

TACTGTTGTA 
CTATTGAGGT 
GGTCTTRARA 
GATTGGTGTT 
25 TTGTATTGAT 
GACTTGTACG 
GTCTAACTAC 
CAGTTCATTT 
CGTTGCAAGA 

3 0 ATTTGCG 



GCTGCATTGA 
TTATGAAGCA 
TAAATTCGAC 
GTGGACTGTC 
TGCTCAGATG 
TGCCTCAGAC 
CGTTTAGTTG 
ATTGGAGTTG 
ACGACGGTGA 
AACAGTAGTT 
GTGTGATGCC 
TGGTTGCTAT 
ACCGGTAGGA 
TTGCGCCGGT 
TGTTCCGCAA 
TTGCTCAAGG 
TCAACCCGGA 
TGTTGACTAC 
CGTCCTCAGG 
GGATRAGGGT 
TCAAGTTACT 



CCAAGGTCTG 
ATTGACCGTG 
ACATGTTGAA 
CAGGTCACTC 
GATGGTGCCA 
TAAGGAGCAC 
TGTTTTTGAA 
GTGGAGTTAG 
CAACACGCCT 
CCGAGGCTGA 
TTTTTACTGA 
TGACGATGTT 
TAGAGCAGGG 
CCGAAGTCCG 
GAGTCTCAGT 
GTGTGAAGCG 
AGTTACAAAT 
GRAGGAAGGT 
CGTTTATACG 
GTTGAGATGG 
TTATCCCATG 



TTCGATGGGT 
CTCCTGAGGA 
TATGAGACTA 
TGATTATGTG 
TATTGGTTGT 
GTGTTGCTTG 
CAAGCTTGAC 
AGGTTCGTGA 
ATCGTTCGTG 
CATTAAGCCA 
CTCCAGAACG 
CTTGC C ATTC 
CAAGATTCGT 
GCAAGAAGAC 
RAGGGTATTG 
CGACRAGGTA 
GTCACGGTGA 
GGGCGCAAGC 
TACTGGAGAC 
CAGCTCCTGG 
CCTGTCCATG 



GGTCATGGCG 5 0 

GCGTAAACGT 100 

AGAACCGTCA 15 0 

AAGAACATGA 200 

TTCTTGTGTT 250 

CTAAGCAGAT 300 

ATGTTAGAGG 350 

GTTATTGAGT 4Q0 

GCAGTGCTAT 450 

ATTCAGGATT 500 

TAAGGATGAC 550 

CTGGCAAGGG 600 

TGTGGTGACC 650 

TGTGTGTGTT 7 00 

CTGGTGACCA 75 0 

GAGCGCGGGT 800 

ATTTGATGCT 850 

ATCCGTTTGT 900 

GTTTGCTGCT 95 0 

TGACAACGTG 1000 

AAGGTTTACG 1050 
1057 



2) INFORMATION FOR SEQ ID NO: 844 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 ( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Leishmania aethiopica 

<B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844 

50 CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGATAT GACGGACGAG GACACGTGCG AC C TGGTGG A 150 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 2 00 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

55 TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 3 50 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 45 0 

CGC CAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

60 CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 55 0 

675 
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10 



RrrrrrGACA CAACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

aaSSggcg ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

to?Sg?Sc? SSSXSXS GAGGGCGGCC gccacaccgg CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGC GCTGACGTGA CAGCGGACAT 750 

SSmCCG GAGGCGGAGA AGCACCGGGA GGAGCTGAAC AAGAAATTCG 800 

rPrrCGGCCC CGAGGAGGAC AAGAAGAAAG AGGCGGAGAT GAAGGAGTTC 850 

gagaSgc totctgcat gccgggcgat aaccgcgagc TGATCCTGAC 900 

GCTCGCGTAC CCGATGCCCA TTGAAAAGGG CCTGAAGTTC ACC 943 



2 ) INFORMATION FOR SEQ ID NO: 845 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 845 

CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCATCT CTTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

30 TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

~~ CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 2 00 

CGCCCATCGT GCGCGGCTCG GCCCTCAAAG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 300 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 350 

35 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGAC GGTG A GGGTGACGGG CATC GAG ATG TACCACAAGA 500 

CGCTGAGTGA GTGCATGCCC GGTG AC TC C G TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CGATCAGTCT CTCCAAGGAC AACGTTGAAC GCGGCATGGT 600 

40 AATGGCGGCA ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGT GCTGACGTGA CGGCGGACAT 7 50 

GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG AGGCGGAGAT GAAAGAGTTC 850 

45 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAC AACCGCGAGC TGATCCTGAC 900 

GCTGGCGTAC CCGATGCCCA TTGACAAGGG TCTGAAGTT 93 9 



50 2 ) INFORMATION FOR SEQ ID NO: 846 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 5 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 



(vi) ORIGINAL SOURCE: 
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CA 02307010 2000-05-19 



(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 846 

5 

CATTGTGGTG GCGGCCACCG ACGGCGTCAT GCCGCAGACA CGCGAGCACC 50 

TCCTGATCTG CTCGCAGATC GGGCTTCCGG CGCTCGTAGG GTTCATCAAC 100 

AAGGTGGACA TGACGGACGA GG AC AC GTGC GACCTGGTGG ACATGGAGCT 15 0 

GCGCGAGCAG CTGGAGAAAT ACAAGTTTCC GGCGGAGGAG ACGCCAATCG 2 00 

10 TGCGCGGCTC AGCCCTCAAA GCCGTCGAGG GCGATGCGAA GTACGAGGAG 2 50 

AACATTCTCG AACTGGTGCG GAAGTGTGAC GAGTGGATCC CTGACCCGCC 3 00 

GCGCAACACA GACAAGCCTT TCCTTATGGC CATCGAGCAC GTTTACGAGA 350 

TCGGCAAGGA CAAGAAGAGC GTTGTCGTGA CCGGCCGCGT CGATCAGGGC 400 

ATTCTGAAGC TCAACACAGA CGCCGAGCTG GCCGGCTTCA GCTCCAAGAA 450 

15 GTCGACGGTG AGGGTGACGG GCATCGAGAT GTACCACAAG AC GC TG AG CG 500 

AGTGCATGCC TGGTGACTCC GTCGGCGTCA GCATTGTCGG GACCGGCGAC 55 0 

ACGACCAGTC TATCCAAGGG CAACGTGGAA CGCGGCATGG TGATGGCGGC 600 

GACGGGTAGC ACGAACCTGT ACAACAAGGT GAAGGCGCAG GTGTACGTGC 65 0 

TG AC G AAGGA TGAGGGCGGC CGCCACACTG GCTTTAGTCC TCACTACCGC 700 

20 CCGCAGCTCT TCTTCCATTG TGCTGACGTG ACGGCGGACA TGAGCTTCCC 75 0 

GGAGGCGGAG AAGCACCGCG AAGAGCTCAA CAAGAAATTC GGCCGCGGCC 800 

CCGAGGAGGA CAAGAAGAAA GAGGCAGCGA TGAAGGAGTT CGAGAGCAAG 85 0 

CTCGTCTGCA TGCCGGGCGA TAACCGCGAG CTGATCCTGA CGCTGGCGTA 900 

CCCGATGCCC ATTGAAAAGG GTCTGAAGTT CACCATCCGT GAGGG 945 

25 

2) INFORMATION FOR SEQ ID NO: 847 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania Infantum 
40 (B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 847 

GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

45 GCACCTCCTG ATCTGCTCGC AGATCGGGCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAGGT GGACATGACG GACGAGGACA CGTGCGACCT GGTGGACATG 150 

GAGCTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACGCC 2 00 

AATCGTGCGC GGCTCAGCCC TCAAAGCCGT CGAGGGCGAT GCGAAGTACG 250 

AGGAGAACAT TCTCGAACTG GTGCGGAAGT GTGACGAGTG GATCCCTGAC 300 

50 CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 350 

CGAGATCGGC AAGGACAAGA AGAGCGTTGT CGTGACCGGC CGCGTCGATC 400 

AGGGCGTTCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCTCC 450 

AAGAAGTCGA CGGTGAGGGT GACGGGCATC GAGATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCTGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

55 GCGACACGAC CAGTCTATCC AAGGGCAACG TGGAACGCGG CATGGTGATG 600 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

CGTGC TGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTT AGTCCTCACT 700 

ACCGCCCGCA GCTCTTCTTC CATTGTGCTG ACGTGACGGC GGACATGAGC 750 

TTCCC GGAGG CGGAGAAGCA CCGCGAAGAG CTCAACAAGA AATTCGGCCG 800 

60 CGGCCCCGAG GAGGACAAGA AGAAAGAGGC AGCGATGAAG GAGTTCGAGA 85 0 
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GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 
GCGTACCCGA TGCCCATTGA AAAGGGTCTG AAGTTCACC 



900 
939 



10 



15 



2 ) INFORMATION FOR SEQ ID NO: 848 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettn 

(B) STRAIN: ATCC 50120 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 848 

CACCGACGGC GTCATGCCGC AGACACGGGA GCACCTGCTC ATCTGCTCGC 50 

AGATCGGGCT GCCGGCGCTT GTAGGGTTCA TCAATAAAGT TGACATGACG 100 

GACGAGGATA CGTGCGACCT CGTGGACATG GAGGTGCGGG AACAGCTGGA 150 

25 GAAGTACAAG TTTCCGGCCG AGGAGACGCC CATCGTACGT GGCTCGGCCC 200 

TCAAGGCCCT CGAGGGGGAT GCGCAATACG AGGGGAGTAT TCTCGAGCTG 250 

GTGCGAAAGT GCGACGAGTG GATCCCCGAC CCGCCGCGCA ACACCGAAAA 300 

GCCTTTCCTC ATGGCTATCG AGCACGTTTA CGAGCTCGGC AAAGACAAGA 350 

AGAGCGTCAT CGTTACCGGC CGCGTCGATC AAGGTGTGCT GAAGCTCAAC 400 

30 ACAGACGCCG AGCTGGCCGG CTTCAGCGCC AAGAAGGCGA CAGTCAAAGT 450 

Sacg3§ca5c GAGATGTATC -ACAAGAGACT GAATGAGTGC ATGCCCGGCG 500 

ACTCTGTCGG TGTCAGCATC GTCGGTACCG GTG AC ACG AC CAGCTTATCC 550 

AAGGATAATG TTGAGCGCGG TATGGTAATG GCGGCAACGG GTAGCACGAA 600 

CCTGTACAAC AAGCTGAAGG CGCAGGTTTA CGTGCTGACA AAGGAGGAGG 650 

35 SggSScCA SaCCGGGTTC AGCCCCCACT ACCGCCCGCA GCTCTTCTTC 700 

CACTGCGCTG ACGTG AC CGC AGACATGAGC TTCCCGGAGG CGGAGAAGTA 750 

CCGCGAGGAG CTCAACAAGA AGTTCGGCCG TGGCCCTGAG GAGGACAAGA 800 

AGAAAGAGGC GGAGATGAAG GAGTTCGAAA GCAAACTTGT CTGCATGCCA 850 

GGCGATAACC GCGAGCTGAT CCTAACTCTG GCGTACCCGA TGCCCATCGA 900 

CAAGGGCCTG AAGTTCACCA TCCGTGAGGG CGG 933 



40 



45 



2 ) INFORMATION FOR SEQ ID NO: 849 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doub 1 e 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: LeisJimania gerbilli 

(B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 849 

60 CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 5 0 
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GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 15 0 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 200 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 25 0 

5 TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 350 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACGGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATTGAGATG TACCACAAGA 500 

10 CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 55 0 

ACCGGCGACA CGACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGC GCTGACGTGA CAGCGGACAT 75 0 

15 GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG AGGCGGAGAT GAAGGAGTTC 850 

GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT AACCGCGAGC TGATCCTGAC 900 

GCTGGCGTAC CCGATGCCCA TTGAAAAGGG TCTGAAGTTC ACC 943 



20 



35 



2 ) INFORMATION FOR SEQ ID NO: 850 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 918 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850 



GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

ACACCTCCTG ATCTGCTCGC AAATTGGCCT TCCGGCGCTC GTAGGGTTCA 100 

40 TCAACAAAGT GGACATGACG GACGAGGACA CGTGTGACCT GGTGGACATG 150 

GAGGTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACACC 200 

AATCGTGCGC GGCTCGGCCC TCAAGGCCGT CGAGGGCGAC GCGAAGTACG 250 

AGGAGAACAT CCTCGAACTG GTGCGGAAGT GCGACGAGTG GATCCCCGAC 3 00 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 350 

45 CGAGATCGGC AAGGACAAGA AGAGCGTCAT CGTGACCGGC CGCGTCGATC 400 

AGGGCGTGCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCGCC 450 

AAGAAGTCGA CGGTGAGGGT GACGGGCATT GAAATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCCGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTGTCC AAGGACAACG TGGAGCGCGG CATGGTAATG 600 

50 GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTC AGCCCCCACT 700 

ACCGCCCGCA GCTCTTCTTC CATTGCGCTG ACGTGACAGC GGACATGAGC 750 
TTCCCGGAGG CGGAGAAGCA CCGCGAGGAG CTCAACAAGA AATTCGGCCG . 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC GGAGATGAAG GAGTTC GAGA 850 

55 GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 900 

GCGTACCCGA TGCCCATT 918 



60 2 ) INFORMATION FOR SEQ ID NO: 851 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 



15 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851 



(ii) MOLECULE TYPE: Genomic DNA 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCATCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 200 

20 CGCCCATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 300 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 3 50 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 450 

25 CGTCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 5 00 

CGC TGAGTGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CGATCAGTCT CTCCAAGGAC AACGTTGAAC GCGGCATGGT 600 

AATGGCGGCA ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 7 00 

3 0 CACTACCGCC CGCAGCTCTT CTTCCATTGT GCTGACGTGA CGGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG AGGC GG AG AT GAAAGAGTTC 850 

GAGAGCAAGC TCGTCTGCAT GCCGGGCGAC AACCGCGAGC TGATCCTGAC 9 00 

GCTGGCGTAC CCGATGCCCA TTGAGAAGGG TCTGAAGTT 93 9 

35 

2 ) INFORMATION FOR SEQ ID NO: 852 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 
50 (B) STRAIN: MOU-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852 

TCATTGTGGT GGCCGCCACC GACGGCGTCA TGCCGCAAAC ACGGG AG C AC 50 

55 CTTTTGATCT GCTCGCAGAT CGGGCTGCCG GCGCTCGTAG GGTTCATCAA 100 

CAAAGTGGAC ATGACAGACG AAGACACGTG CGACCTGGTA GACCTGGAGG 150 

TGCGTGAGCA GCTGGAGAAG TACAAGTTTC CGGCAGAGGA AACACCAATC 200 

GTGCGTGGCT CGGCCCTCAA GGCCGTTGAG GGCGATGCAA AGTACGAGGA 250 

GAACATCCTC GAACTGGTGC GGAAGTGCGA CGAGTGGATC CCAGACCCGC 300 

60 CACGCAATAC GGACAAGCCT TTCCTTATGG CCATTGAACA CGTGTACGAG 350 

680 
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ATCGGCAAGG ATAGGAAAAG 
TGTGCTGAAG CTGAACACAG 
AGTCGACGGT GAAAGTGACG 
GAGTGCATGC CCGGCGACTC 
5 CACGACCAGC CTCTCTAAGG 
CTACGGGTAG CACGAACCTG 
CTCACGAAGG ATGAGGGCGG 
TCCGCAGCTC TTCTTCCACT 
CGGAGGCGGA GAAGCACCGC 
10 CCCGAGGAGG ACAAGAAAAA 
GCTGGTCTGC ATGCCAGGCG 
ACCCGATGCC TA 



CGTCATCGTA ACCGGCCGCG 
ACGCCGAGCT GGCCGGCTTC 
GGCATTGAGA TGTACCACAA 
TGTCGGCGTC AGCATTGTGG 
ACAATGTTGA GCGTGGCATG 
TACAACAAAG TAAAGGCGCA 
CCGCCACACC GGCTTCAGCC 
GCGCTGACGT AACGGCGGAC 
GAGGAACTCA ATAAGAAATT 
GGAGGCGGAG ATGAAGGAGT 
ATAACCGCGA GCTGATCCTG 



TCGATCAAGG 4 00 

AGCGCCAAGA 450 

GACACTGACA 5 00 

GCACTGGYGA 550 

GTACTGGCCG 600 

GGTGTATGTA 650 

CCCACTACCG 7 00 

ATGAGCTTCC 7 50 

CGGCCGCGGC 800 

TCGAGAGCAA 850 

ACATTGGCGT 9 00 
912 



15 



25 



2 ) INFORMATION FOR SEQ ID NO : 85 3 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 6 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: MM3 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853 

ATTCTTGTGG TGGCAGCTAA CGACGGATGC ATGCCGCAGA CGCGTGAGCA 50 

CCTGCTTATT TGTTCGCAGA TTGGCCTTCC TGCTCTTGTA TGCTTTATCA 100 

ATAAGTGTGA CATGATGCAA GGGCAGGAGG AAATGATTGA ACTTGTTGAA 150 

35 ATGGAGGTAC GTGAACTTTT GGAGAAGTAC AAGTTCCCTG CGGAGGAGAC 2 00 

GCCATTTGTG CGGGGGTCTG CGGTGAAGGC ATTGGAGGGT GATGCTGAAA 2 50 

ATGAAGGAAA GATTTTGGAG CTTGTAAAAA AATGTGATGA ATGGATTCCC 3 00 

GACCCACCGC GTGCCATTGA AAAACCGTTC CTTATGGCCA TTGAGCACGT 3 50 

TTTTGAGGTT GGAAAGGATA AGAAGGCCGT TGTTGTGAGC GGGCGTGTGG 400 

40 ACCAGGGGCA GTTGAAGGTC GGCGCAGATG CAGAACTTTC CGGGTTTAGC 45 0 

GCAAAGAAGC TGACGGTGAA GGTTGCTAGC ATCGAAATGT ACCATAAAAT 500 

TCTGGAGGAT TGCATGCCTG GTGACTCTGT TGGCGCGAAG ATCGTTGGCA 550 

GCGGTGAAAC AGTGAACCTG TCGAAGGAAA ATGTGGAACG CGGCATGGTA 600 

CTCTCCGCAC CAGGTGCAAC GAC AC TGTTC AACAAGGTCC GCGCGCAGGT 650 

4 5 GTACGTGTTG ACAAAGGAAG AAGGCGGTCG TCACACAGCC TTTAGTCCTC 7 00 

ACTATCGTCC GCAGCTTTTC TTCCACTGTG CTGATGTCAC GGCAGATATT 75 0 

AACTTCCCGG AAAGCGAGAA GCTTGCAGGG GAGCTGAACA AAAAGTATGG 800 

CCGTGATGCG GCGGAACAGA AGAAGAAGGA GGCAGAACTG AAAGAGTTTG 85 0 

AAAAGACGCT TGTCTGCATG CCTGGTGATA ACCGCGAACT CCTGCTCACC 900 

50 CTTGCCTATC CAATGCCAAT GGAAAAGGGA CTCAAG 93 6 



55 



2 ) INFORMATION FOR SEQ ID NO: 854 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 934 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 
60 (D) TOPOLOGY: Linear 

681 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
5 (A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854 

10 CGGCATTCTT GTGGTGGCAG CTAACGACGG ATGCATGCCG CAGACGCGTG 50 

AGCACCTGCT TATTTGTTCG CAGATTGGCC TTCCTGCTCT TGTATGCTTT 100 

ATCAATAAGT GTGACATGAT GCAAGGGCAG GAGGAAATGA TTGAGCTTGT 150 

TGAAATGGAG GTACGTGAAC TTTTGGAGAA GTACAAGTTC CCTGCGGAGG 200 

agacgcStt TGTGCGGGGG TCTGCGGTGA AGGCATTGGA GGGTGATGCT 250 

15 GAAAATGAAG GAAAGATTTT GGAGCTTGTA AAAAAATGTG ATGAATGGAT 3 00 

SSSSSJoCol CCGCGTGCCA TTGAAAAACC GTTCCTTATG GCCATTGAGC 350 

ACGTTTTTGA GGTTGGAAAG GATAAGAAGG CCGTTGTTGT GAGCGGGCGT 400 

GTGGACCAGG GGCAGTTGAA GGTCGGCGCA GATGCAGAAC TTTCCGGGTT 450 

TAGTGCAAAG AAGCTGACGG TGAAGGTTGC TAGCATCGAA ATGTACCATA 500 

20 AAATTCTGGA GGATTGCATG CCTGGTGACT CTATTGGCGC GAAGATCGTT 550 

GGCAGCGGTG AAACAGTGAA CCTGTCGAAG GAAAATGTGG AACGCGGCAT 600 

GGTACTCTCC GCACCAGGTG CAACGACACT GTTCAACAGG GTCCGCGCGC 650 

AGGTGTACGT GTTGACAAAG GAAGAAGGCG GTCGTCACAC AGCCTTTAGT 7 00 

CCTCACTATC GTCCGCAGCT TTTCTTCCAC TGTGCTGATG TCACGGCGGA 7 50 

2 5 TATTAACTTC CCGGAAAGCG AGAAGCTTGC AGGGGAGCTG AACAAAAAGT 800 

ATGGC CGTGA CGCGGCGGAA CAGAAGAAGA AGGAGGCAGA ACTGAAAGAG 850 

TTTGAAAAGA CGCTTGTCTG CATGCCTGGT GATAACCGCG AACTCCTGCT 900 

CACCCTTGCC TATCCAATGC CAATGGAAAA GGGA 9J4 



30 



45 



2 ) INFORMATION FOR SEQ ID NO: 855 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 937 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: CGL-1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 855 



TGGCGGCATT CTTGTGGTGG CAGCTAACGA CGGATGCATG CCGCAGACGC 50 

GTGAGCACCT GCTTATTTGT TCGCAGATTG GCCTTCCTGC TCTTGTATGC 100 

50 TTTATCAATA AGTGTGACAT GATGCAAGGG CAGGAGGAAA TGATTGAACT 150 

TGTTGAAATG GAGGTACGTG AACTTTTGGA GAAGTACAAG TTCCCTGCGG 200 

AGGAGAC GCC ATTTGTGCGG GGGTCTGCGG TGAAGGCATT GGAGGGTGAT 250 

GCTGAAAATG AAGGAAAGAT TTTGGAGCTT GTAAAAAAAT GTGATGAATG 3 00 

GATTCCCGAC CCACCGCGTG CCATTGAAAA ACCGTTCCTT ATGGCCATTG 350 

55 AGCACGTTTT TGAGGTTGGA AAGGATAAGA AGGCCGTTGT TGTGAGCGGG 400 

CGTGTGGACC AGGGGCAGTT GAAGGTCGGC GCAGATGCAG AACTTTCCGG 450 

GTTTAGCGCA AAGAAGCTGA CGGTGAAGGT TGCTAGCATC GAAATGTACC 500 

ATAAAATTCT GGAGGATTGC ATGCCTGGTG ACTCTGTTGG CGCGAAGATC 550 

GTTGGCAGCG GTGAAACAGT GAACCTGTCG AAGGAAAATG TGGAACGCGG 600 

60 CATGGTACTC TCCGCACCAG GTGCAACGAC ACTGTTCAAC AAGGTCCGCG 650 

682 
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CGCAGGTGTA CGTGTTGACA 
AGTCCTCACT ATCGTCCGCA 
AGATATTAAC TTC C CGG AAA 
AGTATGGCCG TGATGCGGCG 
5 GAGTTTGAAA AGACGCTTGT 
GCTCACCCTT GCCTATCCAA 



AAGGAAGAAG GCGGTCGTCA 
GCTTTTCTTC CACTGTGCTG 
GCGAGAAGCT TGCAGGGGAG 
GAACAGAAGA AGAAGGAGGC 
CTGCATGCCT GGTGATAACC 
TGCCAATGGA AAAGGGA 



CACAGCCTTT 7 00 

ATGTCACGGC 7 50 

CTGAACAAAA 800 
AGAACTGAAA 850 
GCGAACTCCT 900 

937 



10 2 ) INFORMATION FOR SEQ ID NO : 856 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 <vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 

(B) STRAIN: Suarez-2 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 856 



CTTATTATGG AGCTGATCAA CAACGTCGCG AAGAAGCACG GTGGTTTCTC 50 

CGTGTTCGCC GGCGTCGGCG AGCGCACCAG GGAGGGCAAC GAGCTGTACC 100 

AC G AG ATG AT GGAGACCGGC GTCATCAAGC GCCGCCAGCT GGATGACGGC 150 

ACGTTCGACT TCTCCGGCTC CAAGGCCGCG CTGGTGTACG GC C AG ATG AA 200 

3 0 CGAGCCGCCA GGTGCCAGGG CGCGTGTTGC CCTCACTGGC CTGACGGTGG 2 50 

CCGAGTACTT CCGTGATGAG GACGGCCAGG ACGTGCTGCT CTTCATCGAC 3 00 

AACATCTACC GTTTCACCCA GGCTGGTTCT GAGGTGAGTG CCCTTTTGGG 3 50 

GCGCATCCCG TCCGCCGTCG GTTACCAGCC GACCCTCGCC ACCGACCTTG 4 00 

GCGCGCTGCA GGAGCGTATC ACGACGACCA AC AAGGGC TC CATCACCTCC 45 0 

35 GTGCAGGCCG TCTACGTGCC GGCCGACGAT ATCACCGACC CGGCGCCTGC 500 

GACCACCTTC ACCCATCTGG ACGCGACCAC TGTGCTCTCC CGTTCCATCG 550 

CCGAGCTGGG TATCTACCCC GCCGTCGACC CGCTCGACTC CACCTCGCGT 600 

ATGCTGTCCG CGAACATCGT CGGCGAGGAG CAGTACAACG TGGCGCGTGG 650 

CGTGCAGAAA ATACTGCAGG ACTACAAATC GCTGCAGGAT ATCATCGCCA 7 00 

40 TCCTGGGTAT GGACGAGCTG TCTGAGCAGG ACAAGTTCGT CGTCGCGCGT 7 50 

GCGCGCAAGG TTCAGCGTTT CCTATCCCAG CCCTTCCAGG TGGCTGAGGT 800 

ATTCACCGGC AAGCCCGGAC GTTTCGTCGA GCTGCAGGAC ACCATCAGCG 850 

GCGTCAAGGA GATTTTGGAC GGCGAGTGCG ACGACATG 8 88 

45 

2 ) INFORMATION FOR SEQ ID NO: 857 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 884 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bo vis 

(B) STRAIN: Suarez-3 

60 

683 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857 

TGATTATGGA ATTGATCAAC AATGTCGCCA AGAAACACGG TGGGTTCTCC 50 

GTGTTCGCTG GTGTTGGTGA ACGTACGAGG GAAGGTAACG AACTGTACCA 100 

5 TGAAATGATG GAAACGGGTG TCATCAAGCG CCGTCAACTG GAAGACGGAA 150 

CATTTGACTT CTCGGGCTCT AAAGCTGCTT TGGTGTACGG ACAAATGAAC 200 

G AAC C AC C AG GTGCTAGAGC CCGTGTTGCA CTCACGGGAT TGACCGTTGC 250 

CGAGTATTTC CGTGATGAAG AGGGGCAGGA TGTGCTACTC TTCATCGATA 300 

ACATCTACCG TTTCACCCAG GCCGGTTCCG AAGTGAGTGC GCTGTTAGGA 350 

10 AGAATTCCAT CCGCCGTGGG TTATCAACCT ACATTGGCCA CTGATCTCGG 400 

AGCACTCCAG GAACGCATTA CTACAACCAA CAAGGGTTCG ATTACATCAG 450 

TCCAGGCAGT ATACGTCCCA GCCGATGATA TCACTGATCC CGCTCCAGCT 500 

ACCACTTTCT CGCACTTGGA TGCCACTACA GTGCTTTCTC GTTCAATTGC 550 

GGAGTTGGGT ATTTACCCTG CGGTCGACCC GCTTGACTCA ACGTCACGTA 600 

15 TGCTGTCGGC CAACATTGTA GGACAGGAAC AGTACGATGC CGCACGTGGT 650 

GTACAGAAAA TTTTACAGGA CTACAAATCA CTGCAGGATA TCATTGCCAT 7 00 

TCTGGGTATG GACGAGCTGT CTGAGCAGGA CAAGTTCGTT GTAGCACGCG 750 

CCCGTAAGGT ACAGCGTTTC CTGTCTCAGC CGTTCCAAGT GGCTGAGGTG 800 

TTCACCGGCA AGCCTGGGAG GTTCGTTGAA C T AC AGG AT A CCATCAGCGG 850 

20 TGTCAAGGAA ATCTGGAAGG TGAGTGTGAC GATA 884 



2) INFORMATION FOR SEQ ID NO : 858 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Babesia microttl 

(B) STRAIN: Persing-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858 

40 TGGAACTGAT TAATAATGTG GCCAAAAAGC ATGGCGGTTA CTCTGTTTTT 50 

GCAGGTGTAG GTGAAAGGAC GAGGGAGGGT AATGAATTGT AC C ATGAAAT 100 

GATGGAGACA GGTGTTATAA AGAAAAAGGC ACTAGGTGGT GGGAAGTTTG 150 

ATTTCAGTGG ATCTAAAGCA GCGCTGGTCT ATGGACAAAT GAACGAGCCA 200 

CCTGGGGCCC GTGCTAGAGT GGCACTAACT GGATTAACAG TCGCAGAATA 250 

45 TTTCCGTGAC GAACAAGGAC AAGACGTGTT GTTGTTTATT GATAATATTT 300 

ACCGATTTAC TCAGGCAGGG TCTGAGGTTT CAGCCTTGCT AGGCCGTATA 350 

CCTTCAGCTG TGGGATACCA GCCTACATTG GCAACAGATC TTGGCTGTTT 400 

ACAAGAACGA ATT AC TACG A CCAAATCTGG TTCAATCACC AGTGTACAAG 450 

CTGTGTATGT GCCAGCAGAT GATATTACTG ATCCAGCGCC TGCCACAACT 500 

50 TTTACTCACT TGGACGCTAC TACTGTACTT AGCAGGCCAA TTGC TGAACT 550 

CGGTATTTAT CCAGCGGTAG ACCCGTTGGA TTCAACAAGC CGTATGCTAA 600 

GCGCGAACAT TGTGGGAAAT G AAC AC TATA GTGTAGCCCG TTCCGTGCAG 650 

AAGATACTGC AAGATTACAA ATCGCTTCAG GACATTATTG CCATTTTGGG 700 

TATGGATGAA CTGTCGGAAC AAGACAAAAA TATAGTAGCC CGAGCAAGGA 750 

55 AGATGCAAAG GTTCTTATCA CAGCCATTCC AAGTGGCGGA AGTTTTT AC T 800 

GGTAAAC CGG GAAGATTTGT GGAATTGGAA GATACAATTG CCGGGGCACG 850 

AGATATAATT GCGGGTAATT G 871 



60 



684 
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10 



2 ) INFORMATION FOR SEQ ID NO : 859 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1286 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

(B) STRAIN: ATCC 5012 6 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859 

TCTCGGAGGG CGTCCCGCCC GTGCTGACGG CGCTGGATGT GACGGAGGAC 50 

CTTGGCCGCG ATGAGCCGCT GACGCTGGAG ATCGTGCAGC ACCTGGACGC 100 

GAACACCGGC CGCTGCATTG CGATGCAGAC GACGGACCTG CTGAAGCTGA 150 

20 AGTCGAAGGT TGTGTCGACC GGCGGCAACA TCTCTGTGCC GGTGGGCCGT 2 00 

GAGACGCTGG GCCGCATCTT CAACGTGCTG GGCGACGCGA TCGACCAGCG 2 50 

CGGCCCCGTG GGTGAGAAGA TGCGCATGGC GATCCACGCC GAGGCCCCGA 300 

AGCTGGCGGA TCAGGCCGCG GAGGACACGA TCCTGACGAC CGGCATCAAG 3 50 

GTGATCGACC TGATTCTGCC CTACTGCAAG GGTGGCAAGA TCGGCCTGTT 400 

25 TGGCGGCGCC GGTGTGGGCA AGACCGTGAT CATCATGGAG CTGATTAACA 450 

ACGTCGCGAA GGGCCACGGT GGTTTCTCGG TGTTTGCCGG CGTTGGCGAG 500 

CGCACGCGCG AGGGC AC GG A CCTGTACCTG GAGATGATGC AGTCGAAGGT 550 

GATTGACCTG AAGGGCGAGT CGAAGTGCGT GCTTGTGTAC GGGCAGATGA 600 

ACGAGCCCCC GGGTGCGCGC GCGCGCGTTG CGCAGTCTGC GCTGAC GATG 650 

3 0 GCGGAGTACT TCCGAGACGT GG AGGGC C AG AATGTGCTGC TGTTCATCGA 700 

CAACATCTTC CGCTTCACGC AGGCGAACTC CGAGGTCTCT GCGCTGCTGG 750 

GCCGCATTCC GGCCGCCGTG GGCTACCAGC CGACGCTTGC GG AGGATCTT 800 

GGTATGCTGC AGGAGCGCAT CACGTCGACG ACGAAGGGGT CGATCACGTC 850 

CGTGCAGGCC GTGTACGTGC CTGCGGATGA TATC AC GG AT CCGGCGCCCG 900 

3 5 CGACGACGTT CTCGCACCTG GACGCGACGA CTGTGCTGGA CCGCGCGGTG 950 

GCGGAGTCGG GGATCTACCC TGCCGTGAAC CCGCTGGAGT GCGCGTCGCG 1000 

TATCATGGAC CCCGATGTGA TCGACGTGGA CCACTACAAC GTTGCGCAGG 1050 

ATATCGTGCA GATGCTGACC AAGTACAAGG AGCTGCAGGA TATCATTGCG 1100 

GTGCTTGGTA TCGACGAGCT GAGCGAGGAG GACAAGGTCG TGGTGGACCG 1150 

40 CGCGCGCAAG GTGACCCGGT TCCTGTCGCA GCCGTTCCAG GTTGCGGAGG 1200 

TGTTCACGGG CATGACGGGC CACTACGTGC AGCTGGCCGA C AC GGTGGAG 1250 

TCGTTC TCTG GGCTGCTGAT GGGGTCGTAC GACCAG 1286 



45 



55 



2 ) INFORMATION FOR SEQ ID NO: 860 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1222 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 



60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 860 

685 
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CTCGGAGGGC GTCCCGCCCG TGCTGACGGC GCTGGATGTG ACGGAGGACC 50 

TTGGCCGCGA TGAGCCGCTG ACGCTGGAGA TCGTGCAGCA CCTGGACGCG 100 

AACACCGGCC GCTGCATTGC GATGCAGACG ACGGACCTGC TGAAGCTGAA 150 

5 GTCGAAGGTT GTGTCGACCG GCGGCAACAT CTCTGTGCCG GTGGGCCGTG 2 00 

AGACGCTGGG CCGCATCTTC AACGTGCTGG GCGACGCGAT CGACCAGCGC 250 

GGCCCCGTGG GTGAGAAGAT GCGCATGGCG ATCCACGCCG AGGCCCCGAA 3 00 

GCTGGCGGAT CAGGCCGCGG AGGACACGAT CCTGACGACC GGCATCAAGG 350 

TGATCGACCT GATTCTGCCC TACTGCAAGG GTGGCAAGAT CGGCCTGTTT 400 

10 GGCGGCGCCG GTGTGGGCAA GACCGTGATC ATCATGGAGC TGATTAACAA 450 

CGTCGCGAAG GGCCACGGTG GTTTCTCGGT GTTTGCCGGC GTTGGCGAGC 500 

GCACGCGCGA GGGCACGGAC CTGTACCTGG AGATGATGCA GTCGAAGGTG 550 

ATTGACCTGA AGGGCGAGTC GAAGTGCGTG CTTGTGTACG GGCAGATGAA 600 

CGAGCCCCCG GGTGCGCGCG CGCGCGTTGC GCAGTCTGCG CTGACGATGG 650 

15 CGGAGTACTT CCGAGACGTG G AGGGC C AG A ATGTGCTGCT GTTCATCGAC 700 

AACATCTTCC GCTTCACGCA GGCGAACTCC GAGGTCTCTG CGCTGCTGGG 750 

CCGCATTCCG GCCGCCGTGG GCTACCAGCC GACGCTTGCG GAGGATCTTG 800 

GTATGCTGCA GGAGCGCATC ACGTCGACGA CGAAGGGGTC GATCACGTCC 850 

GTGCAGGCCG TGTACGTGCC TGCGGATGAT ATCACGGATC CGGCGCCCGC 9 00 

20 GACGACGTTC TCGCACCTGG ACGCGACGAC TGTGC TGGAC CGCGCGGTGG 950 

CGGAGTCGGG GATCTACCCT GCCGTGAACC CGCTGGAGTG CGCGTCGCGT 1000 

ATCATGGACC CCGATGTGAT CGACGTGGAC CACTACAACG TTGCGCAGGA 1050 

TATCGTGCAG ATGCTGACCA AGTACAAGGA GCTGCAGGAT ATCATTGCGG 1100 

TGCTTGGTAT CGACGAGCTG AGCGAGGAGG ACAAGGTCGT GGTGGACCGC 1150 

25 GCGCGCAAGG TGACCCGGTT CCTGTCGCAG CCGTTCCAGG TTGCGGAGGT 1200 

GTTCACGGGC ATGACGGGCC AC 1222 



30 2 ) INFORMATION FOR SEQ ID NO: 861 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1246 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 861 



CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG CGATGAGCCG 50 

CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACCG GCCGCTGCAT 100 

TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG GTTGTGTCGA 150 

CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT GGGCCGCATC 2 00 

50 TTCAACGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG TGGGCGAGAA 250 

GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG GATCAGGCCG 3 00 

CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA CCTGATTCTG 350 

CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG CCGGTGTGGG 400 

CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG AAGGGCCACG 450 

55 GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG CGAGGGCACG 500 

GACCTGTACC TGGAGATGAT GCAGTCGAAG GTG ATTGAC C TGAAGGGCGA 550 

GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT GAACGAGCCC CCGGGTGCGC 600 

GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA CTTCCGCGAC 650 

GTGG AGGGC C AGAACGTGCT GCTGTTCATC GACAACATCT TCCGCTTCAC 7 00 

60 GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT CCGGCCGCCG 750 

686 
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TGGGCTACCA GCCGACGCTT GCGGAGGATC 
ATCACGTCGA CAACGAAGGG GTCGATCACG 
GCCAGCGGAT GATATCACGG ATCCCGCGCC 
TGGACGCGAC GACTGTGCTG GACCGCGCGG 
5 CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 
GATCGATGTG GACCACTACA ACGTTGCGCA 
CCAAGTACAA GGAGCTGCAG GATATCATTG 
CTGAGCGAGG AAGACAAGGT TGTTGTGGAC 
GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 
10 GCCACTACGT GCAGCTGGTC GACACGGTGG 



TTGGTATGCT GCAGGAGCGC 800 

TCCGTGCAGG CCGTGTACGT 850 

CGCGACGACG TTCTCGCACC 900 

TGGCGGAGTC GGGCATCTAC 9 50 

CGTATCATGG ACCCCGATGT 1000 

GGATATCGTG CAGATGCTGA 1050 

CGGTGCTTGG CATCGACGAG 1100 

CGCGCGCGCA AGGTGACCCG 1150 

GGTGTTCACG GGCATGACGG 12 00 

AGTCGTTCTC TGGCCT 12 46 



15 



2 ) INFORMATION FOR SEQ ID NO : 862 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1265 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 3 0815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862 

30-. GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG 50 

CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACGG 100 

GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG 150 

GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT 2 00 

GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG 2 50 

35 TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG 3 00 

GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA 3 50 

CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG 4 00 

CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG 4 50 

AAGGGCCACG GCGGTTTCTC CGTGTTTGC C GGCGTTGGCG AGCGCACGCG 5 00 

40 CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC 550 

TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT GAACGAGCCC 6 00 

CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA 650 

CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT 7 00 

TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT 750 

45 CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT 800 

GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG 850 

CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG 900 

TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC 950 

GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG 1000 

50 ACCCTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG 1050 

CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG 1100 

CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA 1150 

AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG 1200 

GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC 1250 

55 TGGCCTGCTG ATGGG 1265 



2 ) INFORMATION FOR SEQ ID NO: 863 

60 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Bordetella pertussis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863 

ATGGCAAAAG GCAAGTTTGA ACGTACCAAG CCGCACGTGA AC GTGGGT AC 50 

15 GATTGGTCAC GTTG AC C AC G GCAAAACGAC GTTGACGGCG GCGATCACGA 100 

CGGTGCTGTC GAACAAGTTC GGCGGCGAGG CTCGCGGCTA CGACCAGATT 150 

GACGCGGCGC CGGAAGAGAA GGCGCGTGGG ATCACGATCA ACACCTCGCA 2 00 

CGTTGAGTAC GAGACGGAGA CGCGTCACTA CGCGCACGTT GATTGCCCGG 2 50 

GTCACGCTGA CTACGTGAAG AACATGATCA CGGGTGCTGC GCAGATGGAC 3 00 

2 0 GGCGCGATCC TGGTGGTGTC GGCCGCAGAC GGCCCGATGC CGCAGACGCG 3 50 

CGAGCACATT TTGCTGTCGC GCCAGGTTGG CGTGCCGTAC ATCATCGTGT 400 

TCCTGAACAA GGCGGACATG GTTGATGACG CGGAGCTGCT CGAGCTGGTG 450 

GAGATGGAAG TCCGCGAACT GCTGAGCAAG TACGATTTCC CGGGCGATGA 500 

CACGCCGATC GTGAATGGTG CGGCCAAGCT GGCGCTGGAA AGCGACAACG 55 0 

25 GCGACCTGGG CGAGCAGGCG ATTCTGTCGC TGGCGCAAGC GCTGGACACG 600 

TACATTCCGA CGCCGGAGCG CGCGGTCGAC GGTGCGTTCC TGATGCCGGT 650 

GGAAGACGTG TTCTCGATCT CGGGCCGTGG CACGGTGGTG AC TGGC CGT A 7 00 

TCGAGCGCGG CGTGGTGAAG GTTGGCGAGG AAATCGAAAT CGTGGGCATC 7 50 

AAGCCGACGG TGAAGACGAC CTGCACGGGC GTGGAGATGT TCCGCAAGCT 800 

3 0 GCTGGACCAG GGCCAGGCGG GCGACAACGT GGGTATCTTG CTGCGCGGCA 850 

CCAAGCGTGA AGACGTCGAG CGTGGCCAGG TGCTGGCCAA GCCGGGTTCG 900 

ATCAACCCGC ACACGGACTT CACGGCCGAG GTGTACATTC TGTCCAAGGA 950 

AGAGGGTGGC CGTCACACGC CGTTCTTCAA CGGCTATCGT CCGCAGTTCT 1000 

ACTTCCGCAC GACGGACGTG ACCGGCACGA TCGACCTGCC GGCGGACAAG 1050 

35 GAAATGGTGC TGC CGGGCG A CAACGTGTCG ATGACCGTCA AGCTGCTGGC 1100 

CCCGATCGCC ATGGAAGAAG GTCTGCGTTT CGCCATCCGT GAAGGC GGTC 1150 

GTACCGTCGG TGCCGGCGTC GTCGCCAAGA TCATCAAGTA A 1191 



40 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 864 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1350 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma, brucei 

(B) STRAIN: LVH/75 /USAMRU-K/ 18 , 

(C) ACCESSION NUMBER: extracted from U10562 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864 



ATGGGAAAGG AAAAGGTGCA CATGAATCTT GTGGTGGTGG GCCACGTCGA 50 

TGCCGGTAAA TCCACTGCAA CGGGTCACTT GATCTACAAG TGCGGTGGTA 100 

60 TTGACAAACG TACGATCGAG AAGTTCGAGA AAGAAGCTGC CGACATTGGT 150 
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AAGGCCTCAT TCAAGTACGC ATGGGTGCTG GACAAGCTGA AGGCTGAGCG 200 

CGAACGTGGT ATCACGATCG ACATTGCACT GTGGAAATTC GAGTCACCCA 250 

AGTCTGTCTT CACTATTATT GATGCTCCTG GGCACCGTGA CTTCATCAAG 3 00 

AACATGATCA CCGGCACATC GCAAGCCGAC GCAGCCATCC TCATCATTGC 350 

5 CTCTGCGCAG GGTGAGTTCG AGGC TGGTAT CTCCAAGGAT GGACAGACCC 400 

GCGAGCACGC GTTGCTGGCC TTCACTTTGG GTGTGAAGCA GATGGTTGTG 450 

TGCTGCAACA AAATGGACGA CAAGACTGTG AACTACGGAC AGGAGCGGTA 500 

TGACGAGATT GTGAAGGAGG TGTCTGCTTA CATCAAGAAG GTTGGGTACA 550 

ACGTGGAGAA GGTGCGCTTC GTCCCCATCT CCGGATGGCA GGGCGACAAC 600 

10 ATGATTGAGA AATCCGAGAA GATGCCATGG TACAAGGGTC CAACGCTCCT 650 

GG AGGC AC TA GACATGCTGG AGCCACCAGT GCGTCCGAGC GACAAGCCCC 7 00 

TGCGTCTGCC ACTGCAGACG TGTACAAAGA TCGGTGGTAT TGGCACCGTG 750 

CCCGTTGGTC GTGTGGAGAC CGGCGTGATG AAGCCTGGTG ATGTGGTGAC 8 00 

GTTTGCCCCC GCCAACGTGA CGACCGAGGT GAAATCGATC GAGATGCACC 850 

15 ACGAGCAGCT CGCTGAGGCG ACCCCCGGTG ACAACGTCGG CTTTAACGTG 900 

AAGAACGTTT CTGTAAAGGA CATCCGCCGT GGCAACGTCT GCGGTAACAC 950 

CAAGAACGAC CCCCCAAAGG AGGCCGCCGA CTTCACGGCA CAGGTGATCA 1000 

TCCTGAACCA CCCCGGACAG ATTGGAAACG GTTATGCGCC CGTGCTGGAC 1050 

TGCCACACAT CGCACATTGC CTGCAAGTTC GCGGAGATCG AGTCGAAGAT 1100 

2 0 CGACCGTCGC TCTGGCAAGG AGCTGGAGAA GGCTCCCAAG TCGATCAAGT 1150 

CTGGCGACGC CGCGATCGTG CGCATGGTGC CGCAGAAGCC TATGTGCGTG 12 00 

GAGGTCTTCA ACGACTACGC GCCACTCGGC CGCTTTGCCG TGCGTGACAT 1250 

GCGCCAGACC GTCGCTGTCG GTATCATCAA GGCCGTGACC AAGAAGGACG 13 00 

GTTCTGGTGG TAAGGTG AC G AAGGCTGCGG TGAAGGCTTC GAAGAAATAA 13 50 

25 



2) INFORMATION FOR SEQ ID NO: 865 
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(i) 



SEQUENCE CHARACTERISTICS: 

(A) —LENGTH:- 105-2— bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865 



AAGCTCAAGG 
GCAATTCGAA 

45 ACAGAGATTT 
GCTTTATTGG 
AGGTCAAACC 
AAATGATTGT 
CGTTTTGATG 

50 TTACAACACC 
ATAATATGGT 
TTAGTCGAAG 
GCCACTCCGT 
CTGTCCCAGT 

55 GTTACCTTCG 
GCACCATGAG 
ATGTTAAGAA 
GATGCCAAGA 
TATCGTCCTC 

60 TTGACTGTCA 



C TG AG AG AG A 
ACCCCAAAAT 
CATCAAGAAT 
TTGTCCCAGC 
AGAGAACATG 
CGGTATTAAC 
AAATCTTCAA 
GAGAAGATCC 
TGAGAGATCT 
CCCTCGACAC 
CTCCCATTAC 
CGGTCGTGTT 
CTCCAGCTGG 
CAGATGCCAG 
CGTCTCCATC 
ATGACCCAGC 
AACCACCCAG 
CACCGCTCAC 



AAGAGGTATT 
ACCACTACAC 
ATGATTACTG 
CGATCGTTTC 
CTTTATTGGC 
AAGATGGATA 
CGAAGTTGAT 
CATTCGTTGC 
GACAAGATGC 
TATGGAACCA 
AAGATGTTTA 
GAG AC TGGTA 
TGTTACCACT 
AGGCCGTCCC 
AAGGATATCA 
TAAGGGCTGT 
GTGAAATCAA 
ATTTCCTGCA 



ACCATCGATA 
TGTCATTGAT 
GTACCTCTCA 
GAAGGTGCCT 
CTTCACTTTG 
CCTGTGAATA 
GGTTACCTCA 
CATTTCTGGT 
CATGGTATAA 
C C AAAG AG AC 
CAAGATAGGT 
TCATCAGACC 
GAAGTTAAGT 
AGGTGACAAC 
AGAGAGGTTT 
GAAGAC TTC A 
GAACGGTTAC 
AATTCCAGAC 



TTGCTTTATG 5 0 

GCCCCAGGTC 100 

AGC TGATGTT 150 

TCTCCAAGGA 200 

GGTGTCAGAC 250 

CAAGCAATCT 3 00 

AGAAGGTTGG 3 5 0 

TTCGTTGGTG 400 

GGGTAAGACC 450 

CAACTGACAA 5 00 

GGTGTAGGTA 55 0 

AGGTATGAAT 600 

CAGTAGAAAT 650 

GTTGGTTTCA 7 00 

CGTTGCTTCT 7 50 

CTGCTCAAGT 800 

TCTCCAGTCG 850 

TATCACTGCT 9 00 
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AAGATGGACA AGAGATCTGG 
CAAGTCTGGT GATGCTGCTT 
GTGTTGAGGC CTTCACTGAC 
GA 



TAAGGTTTTG GAAGAAAACC 
TGGTTGTTAT GCAACCTTTG 
TACCCACCTC TAGGTCGTTT 



CAAAGCTTAT 95 0 

AAGCCACTTT 1000 
CGCTGTCCGT 1050 
1052 



2) INFORMATION FOR SEQ ID NO: 866 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytics 
20 (B) STRAIN: ATCC 35552 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866 

CAATGAAGTT CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

25 ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

TACAGAAGTT CGAGATAGCG GAGATAGCAT C AGTGTTC C A GTTGGTGATG 2 00 

CTACGTTAGG ACGTGTGTTT AATGTTC TTG GTGATACAAT TG AC TT AG AC 250 

GAGAAGCTTG AT AC TTC TGT CAAACGTGAT CCAATTCATA GAGAAGCACC 3 00 

™ TGC ATTCG AT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 350 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 400 

TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 450 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 5 50 

35 GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

GTGATGTACA AGGACAAGAT GTTTT AC TAT TTATTGATAA CATATTCAGA 7 00 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 7 50 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

40 AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 



45 



2 ) INFORMATION FOR SEQ ID NO: 867 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 867 



60 



AAGGTATTCG ATGCCTTGAA AATGGAAGGC TCCGAGCTGA CCCTGGAAGT 

690 
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ACAACAGCAG 
CCGACGGCCT 
ATGGTGCCAG 
TAACCCGATC 
5 TCCACCGCGC 
CTGGAAACCG 
CGGTAAAGTC 
TGATGGAACT 
TTTGCCGGTG 

10 GATGGCTGAC 
TTGCGATGGT 
GTGGCGCTGA 
AGACGTTCTG 
CCGAAGTATC 

15 CCTACGCTGG 
CAAGACCGGT 



CTGGGCGACG 
GCGTCGCGGC 
TCGGTAAAGC 
GACGAATGCG 
TCCTCCTGCG 
GCATTAAAGT 
GGTCTGTTCG 
GATCAACAAC 
TGGGTGAGCG 
GCCAAAGTGG 
CTACGGTCAG 
CCGGTCTGAC 
TTCTTCGTGG 
GGCACTGCTG 
CCGAAGAAAT 
TCGATCAC 



GCATTGTCCG 
ATGATGATCC 
AACCCTGGGT 
GCGCGGTCGC 
TACGACGAAC 
TATTGACCTG 
GCGGTGCAGG 
ATCGCCAAAG 
TACCCGTGAA 
TCGATCTGGA 
ATGAATGAAC 
CATGGCTGAA 
ACAACATCTA 
GGCCGTATGC 
GGGTCGCCTG 



TACCATTGCA 
AGAACACCGG 
CGCATCATGG 
TCACGACCAG 
TGTCGCCATC 
GTGTGCCCGT 
TGTGGGCAAG 
CACACTCGGG 
GGTAACGACT 
AAATCCAGAG 
CACCAGGCAA 
GCATTCCGTG 
CCGCTTCACC 
CATCGGCTGT 
CAAGAGCGCA 



CTGGGTACCT 100 

CAAACCTATC 150 

ACGTGCTGGG 2 00 

ATCGCTTCGA 250 

GCAAGATCTG 300 

TCGCCAAGGG 350 

ACCGTGAACA 400 

TCTGTCCGTG 450 

TCTACCACGA 500 

AACTCCAAGG 550 

CCGTCTGCGC 600 

ACGAAGGCAA 650 

CTGGCCGGTA 7 00 

GGGTTACCAG 7 50 

TCACTTCGAC 800 
818 



20 2 ) INFORMATION FOR SEQ ID NO : 868 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



(vi> ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 43867 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868 



35 

CTATCTTAGT 
CACATTCTTT 
AAACAAAGTT 
TGGAAGTTCG 

40 CCTGTAATCT 
TGAGCAAAAA 
C AC C AGAACG 
TTCTCAATCA 
TC AAATC AAA 

45 CAAGCAAAAC 
TACGCTGAAG 
TGATGACGTA 
CACATACAAA 
GGTCGTCATA 

50 TACTAC TG AC 
TTATGCCTGG 



AGTATCTGCT 
TATCACGTAA 
GACATGGTTG 
TGACTTATTA 
CTGGTTCTGC 
ATC TT AG ACT 
TGATTC TG AC 
CTGGTCGTGG 
GTCGGTGAAG 
AACTGTTACT 
CTGGTGACAA 
CAACGTGGTC 
ATTC AAAGC G 
C AC C ATTCTT 
GTAACTGGTG 
CGATAACGTT 



GCTGATGGCC 
CGTTGGTGTT 
ACGATGAAGA 
AGCGAATATG 
ATTAAAAGCT 
TAATGCAAGC 
AAACCATTCA 
TACTGTTGCT 
AAATCGAAAT 
GGTGTAGAAA 
CATTGGTGCA 
AAGTTTT AG C 
GATGTTTACG 
C ACT AAC TAC 
TTGTTAACTT 
GAAATGGA 



CAATGCCACA 
CCAGCATTAG 
ATTATTAGAA 
ACTTCCCAGG 
TTAGAAGGCG 
TGTTGATGAC 
TGATGCCAGT 
ACAGGCCGTG 
CATCGGTATG 
TGTTCCGTAA 
TTATTACGTG 
TGCTCCTGGT 
TTTTATCTAA 
CGCCCACAAT 
ACCAGAAGGT 



AACTCGTGAA 5 0 

TTGTATTCTT 100 

TTAGTAGAAA 150 

TGACGATGTA 2 0 0 

ACGCTGACTA 250 

TTCATTCCAA 3 00 

TGAGGACGTA 3 50 

TTGAACGTGG 40 0 

CAAGAAGAAT 450 

ATTATTAGAC 5 00 

GTGTTTCACG 550 

ACTATTACAC 600 

AGATGAAGGT 650 

TCTATTTCCG 7 00 

ACTGAAATGG 7 50 
778 



55 2 ) INFORMATION FOR SEQ ID NO: 869 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 

(B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 869 

TGGTCCTATG CCTCAAACAC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GTGTACCATA CATCGTTGTT TTCTTAAACA AAATGGATAT GGTTGATGAC 100 

gIaGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTGACT TATTGTCAGA 150 

SSSSiSEc CCAGGCGACG ATGTTCCTGT AATCGCTGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGCGATGCT TCATACGAAG AAAAAATCAT GGAATTAATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGACA CTGACAAACC 3 00 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATC AC TGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGACGAAGTT 400 

GAAATCGTTG GTATTGCTGA AGAAACTGCT AAAACAACTG TAACTGGTGT 4 50 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGTGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 5 50 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTTA AAGCTGAAGT 600 

TTACGTTTTA ACAAAAGAAG AAGGTGG AC G TCACACTCCA &4U 



2 ) INFORMATION FOR SEQ ID NO: 870 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 bases 
(B) — TYPE"r~Nuclfe"tcr-aci"d 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870 



ID NO: 870 






TGTTATCACG 


TAACGTTGGT 


50 


ATGGATATGG 


TTG ATGAC G A 


100 


TCGTGACTTA 


TTGTCAGAAT 


150 


TCGCTGGTTC 


TGCTTTGAAA 


200 


AAAATCATGG 


AATTAATGGC 


250 


ACGTGACACT 


GACAAACCAT 


300 


TCACTGGACG 


TGGTACTGTT 


350 


CGCGTTGGTG 


ACGAAGTTGA 


400 


AACAACTGTA 


AC TGGTGTTG 


450 


AAGCAGGGGA 


TAACATTGGT 


500 


ATCCAACGTG 


GACAAGTATT 


550 


AAAATTTAAA 


GCTGAAGTTT 


600 


ACACACCATT 


CTTC 


644 



2 ) INFORMATION FOR SEQ ID NO: 871 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R758 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 871 

15 TCCTATGCCT CAAACACGTG AACACATCTT GTTATCACGT AACGTTGGTG 50 

TACCATACAT CGTTGTTTTC TTAAACAAAA TGGATATGGT TGATGACGAA 100 

G AATTAC TAG AATTAGTTGA AATGGAAGTT CGTGACTTAT TGTCAGAATA 150 

TGACTTCCCA GGCGACGATG TTCCTGTAAT CGCTGGTTCT GCTTTGAAAG 200 

CTCTTGAAGG CGATGCTTCA TACGAAGAAA AAATCATGGA ATT AATGGC T 250 

2 0 GCAGTTGACG AATACGTTCC AACTCCAGAA CGTGACACTG ACAAACCATT 300 

C ATGATGC C A GTCGAAGACG TATTCTCAAT CACTGGACGT GGTACTGTTG 350 

CTACAGGCCG TGTTGAACGT GGACAAGTTC GCGTTGGTGA CGAAGTTGAA 400 

ATCGTTGGTA TTGCTGAAGA AACTGCTAAA ACAACTGTAA CTGGTGTTGA 45 0 

AATGTTCCGT AAATTGTTAG AC TATGC TG A AGCAGGGGAT AACATTGGTG 500 

25 CATTGCTACG TGGGGTTGCT CGTGAAGACA TCCAACGTGG ACAAGTATTA 550 

GCTAAAGCTG GTACAATCAC ACCTCATACA AAATTTAAAG CTGAAGTTTA 600 

CGTTTTAACA AAAGAAGAAG GTGGACGTCA CACTCCA 637 



30 
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2 ) INFORMATION FOR SEQ ID NO: 872 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 643 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872 

GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 15 0 

50 ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 200 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

TGCAGTTGAC GAATACGTTC CAACTCCAGA AC GTG AT ACT GACAAACCAT 3 00 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 3 50 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATG AAG TAG A 400 

55 AATC GTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 450 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 5 00 

GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 55 0 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 600 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT CTT 643 

60 
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2) INFORMATION FOR SEQ ID NO: 873 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873 



GTCCTATGCC 
20 GTAC CAT AC A 
AGAATTG CTA 
ATGACTTCCC 
GCTCTTGAAG 
TGCAGTTGAC 
25 TCATGATGCC 
GCTACAGGCC 
AATCGTTGGT 
AAATGTTCCG 
GCATTGCTAC 
3 0 GGCTAAAGCT 
_ ATGTTTTG AC 



TCAAACTCGT 
TCGTTGTTTT 
GAATTAGTTG 
AGGCGACGAT 
GAGATCCTTC 
GAATACGTTC 
AGTCGAAGAC 
GTGTTGAACG 
ATTGCTGACG 
TAAATTGTTA 
GTGGGGTTGC 
GGTACAATCA 
-AAAAGAAGAA 



GAACACATCT 
CTTGAACAAA 
AAATGGAAGT 
GTTCCTGTAA 
ATACGAAGAA 
CAACTCCAGA 
GTATTCTCAA 
TGGACAAGTT 
AAACTGCTAA 
GACTATGCTG 
TCGTGAAGAC 
CACCTCATAC 
GGTGGACGTC 



TGTTATCACG 
ATGGATATGG 
TCGTGACCTA 
TCGCCGGTTC 
AAAATCATGG 
ACGTGATACT 
TCACTGGACG 
CGCGTTGGTG 
AACAACTGTA 
AAGCAGGGGA 
ATCCAACGTG 
AAAATTCAAA 
-ACACTCCATT 



TAACGTTGGC 50 

TTGATGACGA 100 

TTGTCTGAAT 150 

TGC TTTG AAA 2 00 

AATTGATGGC 2 50 

G AC AAAC C AT 3 00 

TGGTACTGTT 3 50 

ATGAAGTAGA 400 

ACAGGTGTTG 450 

TAACATTGGT 500 

GACAAGTATT 550 

GCTGAAGTTT 600 

C 641 



35 2 ) INFORMATION FOR SEQ ID NO: 874 



40 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 bases 

(B) TYPE : Nucleic acid 
<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: LSPQ 2514 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 4 



ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT 
AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT 
GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG 
TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA 
CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTC TG A 
ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG 
TACAGGCCGT GTTGAACGTG GGCAAATCAA AGTTGGTGAA 
TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG 
TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA 
ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA 

694 



GACGATGAAG 50 

ATCTGAATAC 100 

CATTAAAAGC 150 

TTAATGCAAG 2 00 

CAAACCATTC 2 50 

GTACTGTTGC 3 0 0 

GAAGTTGAAA 3 50 

TGTAGAAATG 400 

TCGGTGCATT 4 50 

GTATTAGCTG 500 
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CTCCAGGTTC AATCACACCT 
TTATCTAAAG ACGAAGGTGG 
TCCACAATTC TATTTCCGTA 
CAGAAGGTAC TGAAATGGTT 



CACACAAAAT TTAAAGCAGA 

ACGTCACACT CCATTCTTCA 

CTAC TGACGT AACTGGTGTT 

ATGCCTGGCG A 



CGTATACGTT 550 
CAAACTATCG 600 
GTTAACTTAC 650 
681 



2 ) INFORMATION FOR SEQ ID NO: 875 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidezmldis 

2 0 (B) STRAIN: R591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875 

ATTATCACGT AAC GTTGGTG TACCAGCATT AGTTGTATTC TTAAACAAAG 50 

25 TTGACATGGT AGACGACGAA GAATTATTAG AATTAGTTGA AATGGAAGTT 100 

CGTGACTTAT TAAGCGAATA TGACTTCCCA GGTGACGATG TACCTGTAAT 150 

CGCTGGTTCT GCATTAAAAG CATTAGAAGG CGATGCTGAA TACGAACAAA 200 

AAATC TT AG A CTTAATGCAA GCAGTTGATG ATTACATTCC AACTCCAGAA 250 

CGTGATTCTG ACAAACCATT CATGATGCCA GTTGAGGACG TATTCTCAAT 3 00 

3 0 CACTGGTCGT GGT AC TGTTG CTACAGGCCG TGTTGAACGT GGTCAAATCA 350 

AAGTTGGTGA AGAAGTTGAA ATCATCGGTA TGCACGAAAC TTCTAAAACA 400 

ACTGTTACTG GTGTAGAAAT GTTCCGTAAA TTATTAGACT ACGCTGAAGC 450 

TGGTGACAAC ATCGGTGCTT TATTACGTGG TGTTGCACGT GAAGACGTAC 500 

AACGTGGTCA AGTATTAGCT GCTCCTGGTT CTATTACACC ACACACAAAA 550 

3 5 TTCAAAGCTG AAGTATACGT ATTATCTAAA GATGAAGGTG GACGTCACAC 600 

TCCATTCTTC ACTAACTATC GCCCACAATT CTATTTC CGT ACTACTGACG 650 

T AAC TGGTGT TGTAAACTTA CCAGA 675 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 87 6 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 704 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 10 ' 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876 

TCTTATTATC ACGTAACGTT GGTGTACCAG CATTAGTTGT ATTCTTAAAC 50 

AAAGTTGACA TGGTAGACGA CGAAGAATTA TTAGAATTAG TTGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

60 TAATCGCTGG TTCTGCATTA AAAGCATTAG AAGGCGATGC TGAATACGAA 2 00 

695 
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CAAAAAATCT TAGACTTAAT GCAAGCAGTT 
AGAACGTGAT TCTGACAAAC CATTCATGAT 
CAATCACTGG TCGTGGTACT GTTGCTACAG 
ATCAAAGTTG GTGAAGAAGT TGAAATCATC 
5 AACAACTGTT ACTGGTGTAG AAATGTTCCG 
AAGCTGGTGA CAACATCGGT GCTTTATTAC 
GTACAACGTG GTCAAGTATT AGCTGCTCCT 
AAAATTCAAA GCTGAAGTAT ACGTATTATC 
ACACTCCATT CTTCACTAAC TATCGCCCAC 
10 GACGTAACTG GTGTTGTAAA CTTACCAGAA 
TGGC 



GATGATTACA TTCCAACTCC 2 50 

GCCAGTTGAG GACGTATTCT 3 00 

GCCGTGTTGA ACGTGGTCAA 350 

GGTATGCACG AAACTTCTAA 4 00 

TAAATTATTA GACTACGCTG 450 

GTGGTGTTGC ACGTGAAGAC 500 

GGTTCTATTA CACCACACAC 5 50 

TAAAGATGAA GGTGGACGTC 600 

AATTCTATTT CCGTACTACT 650 

GGTACAGAAA TGGTTATGCC 7 00 

704 



15 2 ) INFORMATION FOR SEQ ID NO: 877 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 0 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 35984 



30 



35 



40 



45 



(xi) SEQUENCE DESCRIPTION: 



CTGGCGACAA CGTTGAAATG 



ID NO: 877 






TGCCACAAAC 


TCGTGAACAC 


50 


GCATTAGTTG 


TATTCTTAAA 


100 


ATTAGAATTA 


GTTGAAATGG 


150 


TCCCAGGTGA 


CGATGTACCT 


200 


GAAGGCGATG 


CTGAATACGA 


250 


TGATGATTAC 


ATTCCAACTC 


300 


TGCCAGTTGA 


GGACGTATTC 


350 


GGCCGTGTTG 


AACGTGGTCA 


400 


CGGTATGCAC 


GAAAC TTCT A 


450 


GTAAATTATT 


AGACTACGCT 


500 


CGTGGTGTTG 


CACGTGAAGA 


550 


TGGTTCTATT 


ACACCACACA 


600 


CTAAAGATGA 


AGGTGGACGT 


650 


CAATTCTATT 


TCCGTACTAC 


700 


AGGTACAGAA 


ATGGTTATGC 


750 






770 



50 2 ) INFORMATION FOR SEQ ID NO: 878 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 



(vi) ORIGINAL SOURCE: 

696 
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(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 35983 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 8 

5 

TTGTATTCTT AAACAAAGTT GACATGGTAG ACGACGAAGA 
TTAGTTGAAA TGGAAGTTCG TG AC TT ATT A AGCGAATATG 
TGACGATGTA CCTGTAATCG CTGGTTCTGC ATTAAAAGCA 
ATGCTGAATA CGAACAAAAA ATCTTAGACT TAATGCAAGC 

10 TACATTCCAA CTCCAGAACG TGATTCTGAC AAACCATTCA 
TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT 
TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT 
CACGAAACTT CTAAAACAAC TGTTACTGGT GTAGAAATGT 
ATTAGACTAC GCTGAAGCTG GTGACAACAT CGGTGCTTTA 

15 TTGCACGTGA AGACGTACAA CGTGGTCAAG TATTAGCTGC 
ATTACACCAC ACACAAAATT CAAAGCTGAA GTATACGTAT 
TGAAGGTGGA CGTCACACTC CATTCTTCAC TAACTATCGC 
ATTTCCGTAC TACTG AC GTA ACTGGTGTTG TAAACTT AC C 
GAAATGGTTA TGCCTGGCGA CAACGTTGAA ATGACAGTTG 

2 0 TCCAATCGCT ATCGAA 



ATTATTAGAA 50 

ACTTCCCAGG 100 

TTAGAAGGCG 150 

AGTTGATGAT 200 

TGATGC C AG T 2 50 

ACAGGCCGTG 3 00 

CATCGGTATG 35 0 

TCCGTAAATT 4 00 

TTACGTGGTG 4 50 

TCCTGGTTCT 500 

TATCTAAAGA 550 

CCACAATTCT 600 

AGAAGGTACA 650 

AATTAATCGC 7 00 
716 



25 



2 ) INFORMATION FOR SEQ ID NO: 879 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R764 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879 

4 0 CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 5 0 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCCGGT TCTGCTTTGA 2 00 

AAGC TCTTG A AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 250 

45 GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 3 00 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 3 50 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 400 

GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 450 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

50 GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGC T AAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA 640 



55 

2 ) INFORMATION FOR SEQ ID NO : 880 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 831 bases 
60 (B) TYPE: Nucleic acid 

697 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880 

rrrrrCGATC CTGGTTTGCT CGGCTGCCGA CGGCCCCATG CCGCAGACCC 50 

gSagcaSS CCTGCTGTCC CGCCAGGTAG GCGTTCCCTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTCGACGAC GCCGAGCTGC TGGAACTGGT 150 

IS Sagatggaa GTTCGCGATC TGCTGAACAC CTACGACTTC CCGGGCGACG 200 

ACACTCCGAT cItCATCGGT TCCGCGCTGA TGGCGCTGGA AGGCAAGGAT 250 

GACAACGGCA TCGGCGTAAG CGCCGTGCAG AAGCTGGTAG AGACCCTGGA 300 

Sotacatt CCGGAGCCGG TTCGTGCCAT CGACCAGCCG TTCCTGATGC 350 

CGATCGAAGA CGTGTTCTCG ATCTCCGGCC GCGGTACCGT GGTAACCGGT 400 

2 0 St^AgIgC GCGGCATCAT CAAGGTCCAG GAAGAAGTGG AAATCGTCGG 450 

CATCAAGGCG ACCACCAAGA CTACCTGCAC CGGCGTTGAA ATGTTC CGC A 500 

AGC^TCGA CGAAGGTCGT GCTGGTGAGA ACGTTGGTAT CCTGCTGCGT 550 

G^CA^CAAGC GTGAAGACGT AGAGCGTGGC CAGGTTCTGG CCAAGCCGGG 600 

ScSaScaag CCGCACACCA AGTTCGAGTG CGAAGTGTAC GTGCTGTCCA 650 

0* AGGAAGAAGG TGGTCGTCAC ACCCCGTTCT TCAAGGGCTA CCGTCCGCAG 7 00 

TOtIcTTCC GTACCACCGA YGTGACCGGT AACTGCGAAC TGCCGGAAGG 750 

S5aSagItg GTAATGCCGG GCGACAACAT CAAGATGGTT GTCACCCTGA 800 

TCGCTCCGAT CGCCATGGAA GATGGCCTGC G OJJ - 

30 

2 ) INFORMATION FOR SEQ ID NO: 881 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 

(B) STRAIN: R421 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881 



CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 50 

ACCATACATC GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 100 

50 AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTCAGAATAT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 2 00 

TCTTGAAGGC GATGCTTCAT ACGAAGAAAA AATCATGGAA TTAATGGCTG 250 

CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 300 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 3 50 

55 TACAGGCCGT GTTGAACGTG GACAAGTTCG CGTTGGTGAC GAAGTTGAAA 400 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 500 

ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 550 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

60 GTTTTAACAA AAGAAGAAGG TGGACGTCAC ACACCATTCT TC 642 

698 
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2 ) INFORMATION FOR SEQ ID NO: 882 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 882 

20 CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 5 0 

ACCATACATC GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 100 

AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTC AG AAT AT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 2 00 

TCTTGAAGGC GATGCTTCAT ACGAAGAAAA AATCATGGAA TTAATGGCTG 2 50 

2 5 CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 3 00 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 350 

TACAGGCCGT GTTGAACGTG GACAAGTTCG CGTTGGTGAC GAAGTTGAAA 40 0 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 500 

3 0 ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 55 0 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

GTTTTAACAA AAGAAGAAGG TGGACGTCAT ACACCA 63 6 



35 



45 



2 ) INFORMATION FOR SEQ ID NO: 883 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 641 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R422 

5 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883 

GGTCCTATGC CTCAAACACG TGAACATATC TTATTATCAC GTAACGTTGG 5 0 

TGTACCATAC ATCGTTGTAT TCTTAAACAA AATGGATATG GTTGATGACG 100 

AAGAATTATT AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCAGAA 15 0 

55 TACGATTTCC CAGGCGATGA TGTTCCAGTT ATCGCAGGTT CTGCTTTGAA 200 

AGCTTTAGAA GGCGACGAGT C TTATGAAG A AAAAATC TTA GAATTAATGG 25 0 

CTGCAGTTGA CGAATATATC CCAACTCCAG AACGTGATAC TGACAAACCA 300 

TTCATGATGC CAGTCGAAGA CGTATTCTCA ATCACTGGAC GTGGTACTGT 3 50 

TGCTACAGGC CGTGTTGAAC GTGGTGAAGT TCGCGTTGGT GACGAAGTTG 4 00 

60 AAATCGTTGG TATTAAAGAC GAAACATCTA AAACAACTGT TACAGGTGTT 45 0 

699 
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GAAATGTTCC GTAAATTATT AGACTACGCT 
TGC TTTATT A C GTGGTGT AG CACGTGAAGA 
TAGCTAAACC AGCTACAATC ACTCCACACA 
TACGTATTAT CAAAAGAAGA AGGCGGACGT 



GAAGCAGGCG ACAACATCGG 500 

TATCGAACGT GGACAAGTAT 550 

CAAAATTCAA AGCTGAAGTA 600 

CACACTCCAT T 641 



2 ) INFORMATION FOR SEQ ID NO: 884 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 
20 (B) STRAIN: R57 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884 

TATGCCTCAA ACACGTGAAC ATATCTTATT ATCACGTAAC GTTGGTGTAC 50 

25 CATACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTATTAGAAT TAGTAGAAAT GGAAGTTCGT G AC TT ATT AT CAGAATACGA 15 0 

TTTCCCAGGC GATGATGTTC CAGTTATCGC AGGTTCTGCT TTGAAAGCTT 2 00 

TAGAAGGCGA CGAGTC TT AT GAAGAAAAAA TCTTAGAATT AATGGCTGCA 250 

GTTGACGAAT ATATCCCAAC TCCAGAACGT GATACTGACA AACCATTCAT 300 

30 GATGCCAGTC GAAG AC GT AT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 350 

JTCAGGCCGTGT TGAACGTGGT "GAAGTTCGCG "TTGGTSACGA -AGTTGAAATC 400 

GTTGGTATTA AAGACGAAAC ATC TAAAAC A ACTGTTACAG GTGTTGAAAT 450 

GTTCCGTAAA TTATTAGACT ACGCTGAAGC AGGCGACAAC ATCGGTGCTT 500 

TATTAC GTGG TGTTGCACGT GAAGATATCG AAC GTGG AC A AGTATTAGCT 550 

35 AAACCAGCTA CAATCACTCC ACACACAAAA TTCAAAGCTG AAGTATACGT 600 

ATTATCAAAA GAAGAAGGCG GACGTCACAC TCCATTCTTC 640 



40 2 ) INFORMATION FOR SEQ ID NO : 885 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecivm 

(B) STRAIN: R492 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885 



TGCCTCAAAC TCGTGAACAC ATCC TATTGT CTCGTCAAGT TGGTGTTCCT 50 

TACATCGTTG TATTC TTGAA CAAAGTAGAC ATGGTTGATG ACGAAGAATT 100 

ACTAGAATTA GTTGAAATGG AAGTTCGTGA C CT ATTAAC A GAATACGAAT 150 

TCCCTGGTGA CGATGTTCCT GTAGTTGCTG GATCAGCTTT GAAAGCTCTA 200 

60 GAAGGCGACG CTTCATACGA AGAAAAAATT CTTGAATTAA TGGCTGCAGT 250 

700 
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10 



25 



TGACGAATAC ATCCCAACTC CAGAACGTGA CAACGACAAA CCATTCATGA 300 

TGCCAGTTGA AGACGTGTTC TCAATTACTG GACGTGGTAC TGTTGCTACA 350 

GGTCGTGTTG AACGTGGACA AGTTCGCGTT GGTGACGAAG TTGAAGTTGT 4 00 

TGGTATTGCT GAAGAAACTT CAAAAACAAC AGTTACTGGT GTTGAAATGT 450 

TCCGTAAATT GTTAGACTAC GCTGAAGCTG GAGACAACAT TGGTGCTTTA 500 

CTACGTGGTG TTGCACGTGA AGACATCCAA CGTGGACAAG TTTTAGCTAA 550 

ACCAGGTACA ATCACACCTC ATACAAAATT CTCTGCAGAA GTATACGTGT 600 

TGACAAAAGA AGAAGGTGGA CGTCATACTC CA 632 



2 ) INFORMATION FOR SEQ ID NO: 886 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 64 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R57 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 886 



CGGCCCAATG CCTCAAACTC GTGAACACAT CCTATTGTCT CGTCAAGTTG 50 

GTGTTCCTTA CATCGTTGTA TTCTTGAACA AAGTAGACAT GGTTGATGAC 100 

3 0 GAAGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTG AC C TATTAACAGA 15 0 

ATACGAATTC CCTGGTGACG ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA 2 00 

AAGCTCTAGA AGGCGACGCT TCATACGAAG AAAAAATTCT TGAATTAATG 2 50 

GCTGCAGTTG ACGAATACAT CCCAACTCCA GAACGTGACA ACGACAAACC 3 00 

ATTCATGATG CCAGTTGAAG ACGTGTTCTC AATT AC TGG A CGTGGTACTG 35 0 

35 TTGCTACAGG TCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGACGAAGTT 400 

GAAGTTGTTG GTATTGCTGA AGAAACTTCA AAAACAACAG TTACTGGTGT 450 

TGAAATGTTC CGTAAATTGT TAG AC T AC GC TGAAGCTGGA GACAACATTG 500 

GTGCTTTACT ACGTGGTGTT GCACGTGAAG ACATCCAACG TGGACAAGTT 550 

TTAGC TAAAC CAGGTACAAT CACACCTCAT ACAAAATTCT CTGCAGAAGT 600 

40 AT AC GTGTTG ACAAAAGAAG AAGGTGGACG TCATACTCCA 64 0 



2 ) INFORMATION FOR SEQ ID NO: 887 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 887 

60 ATCCTGGTTT GCTCCGCAGC TGACGGCCCA ATGCCACAGA CCCGCGAGCA 50 

701 
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CATCCTGCTG GCCCGCCAAG TTGGCGTTCC 
ACAAGTGCGA CCTGGTTGAC GACGCAGAAC 
GAAGTGCGTG AATTGCTGTC GAAATACGAG 
AATCATCAAG GGTTCGGCAC GTATGGCGCT 
5 TGGGCGTTGA CGCCATCATG CGTCTGGCCG 
CCTACGCCAG AGCGCGCAGT CGATGGCGCC 
CGTGTTCTCG ATCTCGGGTC GCGGTACCGT 
GCGGCGTGAT CAAGGTCGGC GAAGAGATCG 
ACCGTCAAAA CCACTTGCAC CGGCGTGGAA 

10 CCAGGGTCAA GCCGGCGACA ACGTTGGTCT 
GTGAAGACGT ACAGCGTGGT CAGGTTCTGG 
CCGCACAACC ACTTCACCGG CGAGATCTAC 
CGGCCGTCAC ACCCCGTTCT TCAACAACTA 
GTACGACTGA CGTGACCGGT TCGATCGAAC 

15 GTCATGCCAG GCGACAACGT GTCGATCACC 
CGCGAT 



ATACATCATC GTGTTCCTGA 100 

TGCTGGAACT GGTCGAAATG 150 

TTCCCAGGCG ACGACGTACC 2 00 

GGAAGGCAAA GAAGGCGAGA 250 

ATGCACTGGA CAGCTACATC 3 00 

TTCCTGATGC CAGTGGAAGA 35 0 

TGTGACCGGT CGTATCGAGC 400 

AAATCGTCGG CATTATCGAC 450 

ATGTTCCGCA AGCTGCTGGA 500 

GCTGCTGCGC GGCACCAAGC 550 

CCAAGCCAGC GTCGATCAAG 600 

GTTCTGTCGA AAGATGAAGG 650 

TCGTCCACAG TTCTACTTCC 7 00 

TGCCAGCAGA CAAAGAAATG 7 50 

GTCAAGCTGA TCAACCCGAT 800 

806 



20 2 ) INFORMATION FOR SEQ ID NO: 888 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 
25 ( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R503 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 



ID NO: 888 






ATCACGTAAC 


GTTGGTGTAC 


50 


ATATGGTTGA 


TGACGAAGAA 


100 


GACTTATTAT 


CAGAATACGA 


150 


AGGTTCTGCT 


TTGAAAGCTT 


200 


TCTTAGAATT 


AATGGCTGCA 


250 


GATACTGACA 


AACCATTCAT 


300 


TGGACGTGGT 


AC TGTTGC T A 


350 


TTGGTGACGA 


AGTTGAAATC 


400 


AC TGTTAC AG 


GTGTTGAAAT 


450 


AGGCGACAAC 


ATCGGTGCTT 


500 


AACGTGGACA 


AGTATTAGCT 


550 


TTCAAAGCTG 


AAGTATACGT 


600 


TCCA 




634 



2 ) INFORMATION FOR SEQ ID NO: 889 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 493 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

702 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 889 

TGTCTTCATC CAGGAATTGA TTGTGAGTCG TTCCACATGC TCACCTAGTT 50 

TTCGC TCG AT CTTTTCACTA ACGCAAACCA TGTAGAACAA CATTGCCAAG 100 

10 GCCCACGGTG GTTACTCCGT CTTCACTGGT GTTGGTGAGC GTACTCGTGA 15 0 

GGGTAACGAT CTGTACCACG AAATGCAGGA GACTGGTGTC ATTCAGCTCG 20 0 

AGGGTGAATC CAAGGTCGCA CTGGTGTTCG GACAGATGAA CGAGCCCCCC 25 0 

GGTGCCCGTG CCCGTGTCGC CCTTACCGGT CTGACCATTG CCGAGTACTT 300 

CCGTGACGAG GAGGGTCAGG ACGTGCTGCT CTTCATTGAC AACATTTTCC 350 

15 GTTTCACCCA GGCCGGTTCT GAGGTGTCTG CCCTTCTCGG TCGTATCCCC 400 

TCTGCCGTCG GTTACCAGCC CACCCTGGCC GTCGACATGG GTGGTATGCA 45 0 

GGAGCGTATC ACCACCACCA AGAAGGGTTC TATTACCTCC GTC 493 

20 

(i) SEQUENCE CHARACTERISTICS: 890 

(A) LENGTH: 466 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

2 5 (D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Peni ci 1 1 i wn marneffei 

(B) "STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 890 

35 TTGATTGTAC GTCTTTACCT TTCTGCCTGA C TGTTTACG A CAACTAACGA 5 0 

AAGC GTAGAA CAACATTGCC AAGGCTC AC G GTGGTTACTC TGTCTTCACT 100 

GGTGTCGGTG AACGTACTCG TGAGGGTAAC GATTTGTACC ACGAAATGCA 150 

GGAAACTGGT GTCATTCAGC TCGAGGGTGA ATCCAAGGTC GCCCTCGTGT 200 

TCGGTCAGAT GAACGAGCCC CCCGGTGCCC GTGCCCGTGT CGCTCTTACT 25 0 

4 0 GGTTTGACCA TTGCCGAGTA CTTCCGTGAC GAGGAAGGTC AGGACGTGCT 300 

TCTCTTCATT GACAACATTT TCCGTTTCAC TCAGGCCGGT TCTGAGGTGT 350 

CTGCCCTTCT GGGTCGTATC CCCTCTGCCG TCGGTTACCA GCCCACCCTT 400 

GC CGTCGAC A TGGGTATCAT GCAGGAGCGT ATTACCACCA CCACCAAGGG 45 0 

TTCCATCACC TCCGTC 46 6 

45 

2 ) INFORMATION FOR SEQ ID NO: 891 

5 0 (i) SEQUENCE CHARACTERISTICS: 

.(A) LENGTH: 478 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
60 (B) STRAIN: ATCC 42570 

703 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 891 

AGGAGCTGAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

5 ac?gStc?cg GTGAGCGTAC CCGTGAGGGT AACGATCTGT ACCACGAAAT 100 

GCAGGAGACC TCGGTCATTC AGCTCGAGGG CGAGTCTAAG GTGGCCCTGG 150 

?S??Sgtca GATGAACGAG CCCCCGGGTG CTCGTGCCCG TGTCGCTCTT 200 

Ic?3g?Stta CCGTCGCCGA GTACTTCCGT GACCAGGAGG GTCAGGATGG 2 50 

otJ^tog TCCACTCATG CCGAAACATG TGCGTGTTCC GAGGCTAATC 300 

10 ScG?GCCAG TGCTGCTTTT CATCGACAAC ATTTTCCGAT TCACACAGGC 350 

CGGTTCCGAG GTGTCTGCCC TGCTGGGTCG TATCCCCTCT GCCGTCGGTT 400 

aScagcccac CCTCGCCGTC GACATGGGTG GCATGCAGGA GCGTATCACC 450 
ACCACCAAGA AGGGCTCTAT CACCTCCG 



15 



2 ) INFORMATION FOR SEQ ID NO: 892 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 481 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffex 

(B) STRAIN: ATCC 58950 

30 (xi) " SEQUENCE— DESCRIPTION : —SEQ —ID— NO : - 892 

GTCTTTATCC AGGAGTTGAT TGTACGTCTT TACCTTTCTG CCTGACTGTT 50 

TACGACAACT AACGAAAGCG TAGAACAACA TTGCCAAGGC TCACGGTGGT 100 

2 5 TACTCTGTCT TC AC TGGTGT CGGTGAACGT ACTCGTGAGG GTAACGATTT 150 

GTACCACGAA ATGCAGGAAA CTGGTGTCAT TCAGCTCGAG GGTGAATCCA 200 

AGGTCGCCCT CGTGTTCGGT CAGATGAACG AGCCCCCCGG TGCCCGTGCC 250 

CGTGTCGCTC TTAC TGGTTT GACCATTGCC GAGTACTTCC GTGACGAGGA 300 

AGGTCAGGAC GTGCTTCTCT TCATTGACAA CATTTTCCGT TTCACTCAGG 350 

40 CCGGTTCTGA GGTGTCTGCC CTTCTGGGTC GTATCCCCTC TGCCGTCGGT 400 

TACCAGCCCA CCCTTGCCGT CGACATGGGT ATCATGCAGG AGCGTATTAC 450 

CACCACCACC AAGGGTTCCA TCACCTCCGT C 48 1 

45 



2 ) INFORMATION FOR SEQ ID NO : 893 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 08 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



60 



<ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 14285 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 893 
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CTCGTCCGTG GTGCTAAGGC C AC TG AC AC T GGTTSCCCCA TTACCATCCS 50 

CGTCGGCCCC GGTACCCTCG GTCGCATCAT GAACGTCACC GGTGACCCGA 100 

TCGACGAGCG CGGTCCCATC AAGACCGACA AGTTCCGTCC CATCCACGCT 15 0 

5 GAGGCTCCCG AGTTCGTTGA CCAGTCGACC ACCGCTGAGG TTCTCGTGAC 2 00 

TGGTATCAAG GTCGTCGATC TGCTTGCTCC CTACGCCCGT GGTGGTAAGA 2 50 

TTGGTCTGTT TGGCGGTGCC GGTGTTGGCA AGACCGTGTT CATCCAGGAG 3 00 

CTCATCAACA ACATCGCCAA GGCCCACGGT GGTTACTCCG TCTTCACCGG 3 50 

TGTCGGCGAG CGTACCCGTG AGGGTAACGA TCTGTACCAC GAAATGCAGG 400 

10 AGACCTCTGT CATTCAGCTT GACGGTGACT CCAAGGTCGC CCTGGTGTTC 450 

GGTCAGATGA ACGAGCCCCC TGGTGCTCGT GCCCGTGTCG CCTTGACCGG 500 

TTTGACTGTC GCTGAGTACT TCCGTGACGA GGAGGGCCAG GATGGTATGT 550 

TTTGAATTAT TTCCTTGTCG TACAGTTCCA AATCGAAGAA TTACTAACTT 600 

GTCAGTGCTT CTCTTCATCG ACAACATTTT CCGCTTCACC CAGGCCGGTT 65 0 

15 CTGAGGTGTC TGCCCTTCTG GGTCGTATTC CCTCCGCTGT CGGTTACCAG 7 00 

CCCACGCTCG CCGTGGACAT GGGTCTGATG CAGGAGCGTA TTACCACCAC 75 0 

CCGCAAGGGC TCAATTACCT CCGTCCAGGC CGTCTACGTG CCCGCTGACG 800 

ATCTGACGGA TCCCGCCCCC GCCACCACCT TCGCCCATCT GGACGCCACC 850 

AC TGTGC TGT CCCGAGGTAT CTCTGAGCTG GGTATCTACC CCGCTGTCGA 9 00 

20 CCCCCTCGAC TCCAAGTCGC GTATGCTGGA CCCCCGTATT GTCGGTGACG 950 

ACCACTACGA GACCGCCACT CGCGTCCAGC AGATCCTCCA AGAGTACAAG 1000 

TCGCTGCAGG ACATCATCGC CATTCTGGGT ATGGACGAGC TGTCTGAGGC 1050 

CGACAAGCTT ACAGTCGAGC GTGCTCGTAA GATCCAGCGT TTCCTGAGCC 1100 

AGCCGTTCAC GGTCGCGCAG GTCTTCACTG GTATCGAAGG CCAGCTGGTC 1150 

25 GATCTGAAGG ACACTATCGC TTCGTTCAAG GCTATCCTGA GCGGTGAGGG 1200 

TGACAGCC 1208 



30 2) INFORMATION FOR SEQ ID NO: 894 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 894 



TTCAGGAACT TATTGTAAGC CGCCCTCTTT ATGCATTGAG GGTGAATAAG 50 

AAGGCTGACA GGTAATAGAA CAACATTGCC AAGGCTCACG GTGGTT AC TC 100 

C GTGTTC AC T GGTGTCGGTG AGCGTACCCG TGAAGGAAAC GATCTGTACC 150 

ATGAAATGCA GGAAACCCGC GTCATCCAGC TTGATGGCGA GTCTAAGGTC 200 

50 GCCCTTGTGT TCGGTCAGAT GAACGAGCCC CCTGGAGCCC GTGCCCGTGT 250 

CGCTCTTACT GGTCTTACCG TTGCCGAATA CTTCCGTGAC GAGGAGGGCC 300 

AAGATGGTAC GCCTTTTTAC TCTTCTTATT CTTCGGGTCG GACTACAGAA 350 

CTAACCTGCT CCAGTGCTTC TCTTCATTGA TAACATTTTC CGTTTCACAC 400 

AAGCCGGTTC TGAGGTGTCT GCCTTGCTTG GACGTATTCC CTCTGCCGTC 450 

55 GGTTACCAGC CCACTCTCGC CGTCGACATG GGTGGTATGC AGGAACGTAT 500 

CACAACCACC AACAAGGGTT CCATTACTTC CGTG 534 



60 2 ) INFORMATION FOR SEQ ID NO : 895 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 895 

15 



CAAGGCTCAC GGTGGTTACT CCGTCTTCAC TGGTGTCGGT GAGCGTACCC 5 0 

GTGAGGGTAA CGATCTGTAC CACGAAATGC AGGAGACCTC GGTCATTCAG 100 

CTCGAGGGCG AGTCTAAGGT GGCCCTGGTC TTTGGTCAGA TGAACGAGCC 150 

CCCGGGTGCT CGTGCCCGTG TCGCTCTTAC TGGTCTTACC GTCGCCGAGT 200 

2 0 ACTTCCGTGA CCAGGAGGGT CAGGATGGTT AGTTCTCGTC CACTCATGCC 250 

GAAACATGTG CGTGTTCCGA GGCTAATCAA CGTGCCAGTG CTGCTTTTCA 3 00 

TCGACAACAT TTTCCGATTC ACCCAGGCCG GTTCCGAGGT GTCTGCCCTG 350 

CTGGGTCGTA TCCCCTCTGC CGTC GGTT AC CAGCCCACCC TCGCCGTCGA 4 00 

CATGGGTGGC ATGCAGGAGC GTATCACCAC CACCAAGAAG GGCTCTAT 448 

25 



2) INFORMATION FOR SEQ ID NO : 896 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nlger 
40 (B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896 

ATTCAGGAGC TGATTGTAAG TTGCCAATCC ATGAACTGGA GATTTGGTGT 50 

45 GACCCATAGA ACTAACAAAT TATTTAGAAC AACATCGCCA AGGCTCACGG 100 

TGGTTACTCC GTCTTCTGTG GTGTCGGTGA GCGTACTCGT GAGGGTAACG 150 

ATCTGTAC C A CGAAATGCAG GAGACTGGTG TCATCCAGCT CGAGGGTGAC 2 00 

TCCAAGGTCG CTCTGGTCTT CGGTCAGATG AACGAGCCCC CGGGTGCCCG 250 

TGCCCGTGTC GCCCTTACCG GTCTG AC CAT TGCCGAGTAC TTCCGTGACG 300 

50 AGGAGGGTCA GGACGTGCTG CTCTTCATTG ACAACATTTT CCGTTTCACC 350 

CAGGCCGGTT CTGAGGTGTC TGCC CTTCTG GGTCGTATCC CCTCTGCCGT 400 

CGGTTACCAG CCCACTCTGG C CGTCG AC AT GGGTGGTATG CAGGAGCGTA 450 

TTACCACCAC CACCAAGGGT TCCATTACCT CCG 483 



55 



2) INFORMATION FOR SEQ ID NO: 897 



(i) SEQUENCE CHARACTERISTICS: 
$0 (A) LENGTH: 1124 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN : ATCC 14110 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897 

AAGCTCAAGT CCGAGCGTGA GCGTGGTATC ACCATCGACA TTGCCCTCTG 50 

GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

15 CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 150 

GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 2 00 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 2 50 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 300 

GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 3 50 

2 0 GTCCGAGGAT CGTTACAACG AAATTGTCAA GGAAACCTCC AACTTCATCA 400 

AGAAGGTCGG CTACAACCCC AAGGC CGTTC CCTTCGTCCC CATCTCTGGC 450 

TTCAACGGTG ACAACATGCT TGAGCCCTCC TCCAACTGCC CCTGGTACAA 500 

GGGATGGGAG AAGG AG AC C A AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

TCGAGGCCAT CGACGCCATT GAGCCCCCTG TCCGTCCCTY CAACAAGCCC 600 

25 CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 650 

CCCTGTCGGC CGTGTCGAGA CCGGTATCAT CAAGCCCGGC ATGGTCGTCA 7 00 

CCTTCGCCCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 800 

CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 850 

3 0 CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 950 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 1000 

TTGACCGCCG TACCGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 1050 

TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 1100 

3 5 CGAGTCCTTC ACTGACTACC CCCC 1124 



2 ) INFORMATION FOR SEQ ID NO: 898 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1363 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Penicillium mameffei 

(D) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 898 

55 CTCAAGGCTG AGCGTGAGCG TGGTATCACC ATCGATATTG CTCTCTGGAA 50 

GTTCCAGACT GCCAAGTACG AGGTTACCGT CATTGACGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC CGATTGCGCT 150 

ATTCTCATCA TTGCCTCTGG T AC TGGTGAA TTCGAGGCTG GTATCTCCAA 200 

GGATGGCCAG ACTCGTGAGC ACGCTCTTTT GGCTTTCACC CTCGGTGTCC 250 

60 GTCAGCTCAT TGTTGCCCTC AACAAGATGG ACACTTGCAA GTGGTCTCAG 300 

707 
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GGTGAGTACT CGTACCTGCG TTTGGCCTTG AATATCTTAC TAATGCACCA 350 

TAGATCGTTA CAACGAAATT GTCAAGGAGA CTTCCAACTT CATCAAGAAG 400 

GTCGGATACA ACCCTAAGAA CGTTCCTTTC GTTCCTATCT CCGGTTTCAA 450 

SSgaSXc ATGCTTGAGC CCTCCCCCAA CTGCCCCTGG TACAAGGGTT 500 

5 GGGaSSSa SSaIgg2c GGTAAGGTCA CTGGTAAGAC CCTCCTCGAG 550 

GCCATCGACG CCATTGAGCC CCCTACCCGT CCCGCCAACA AGGTTAGTCC 600 

CTCC?CGAC? ACTCAAACCC TCCTCATAAG TTCATGATTA CGACTCGTTC 650 

ACAGCCCCTC CGTCTTCCCC TCCAGGATGT CTACAAGATC GGTGGTATTG 7 00 

GAACGGTTCC CGTCGGTCGT GTTGAGACTG GTACCATCGT TCCTGGTATG 750 

10 GTTGTCACCT TGTAAGTCAC TCTCCTCGCT TATCCTACCT GAAATCATCA 800 

10 TGTGCTAACT TGACACTCAG CGCTCCCGCC AACGTCACCA CTGAAGTCAA 850 

GAGTGTTGAA ATGCACCACC AGCAGCTCAC TGCCGGTCAG CCCGGTGACA 900 

ACGTTGGTTT CAACGTGAAG AACGTCTCCG TCAAGGAAAT CCGTCGTGGT 950 

AACGTTGCTG GTGACAGCAA GAACGACCCC CCTGCCGGTG CTGCCTCCTT 1000 

15 CAACGCCCAG GTCATCGTCC TCAACCACCC CGGTCAGGTC GGTGCTGGTT 1050 

ACGCCCCAGT CCTCGATTGC CACACTGCCC ACATTGCTTG CAAGTTCGCT 1100 

GAGCTCCTCG AGAAGATTGA CCGTCGTACC GGAAAGTCTG TTGAGGACCA 115 0 

CCCCAAGTTC ATCAAGTCCG GTGACGC TGC CATCGTCAAG ATGATTCCTT 1200 

CCAAGCCCAT GTGTGTTGAG GCTTTCACCG AGTACCCTCC TCTCGGTCGT 1250 

20 TTCGCCGTTC GCGAGTAAGT TTTATCTCCG TTGTCTATTT TCCATCCTTC 1300 

CCTTCTCCTC CGTCTTCCAT ATATATTTTT TCAGTTATAT GTGACTAACC 1350 
ACAAATCACG GGA 



25 



2 ) INFORMATION FOR SEQ ID NO: 899 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1147 bases 
30 (B) TYPE: Nucleicacid 

" " — fcl STRARDEDNESS ": — Double 
(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

(B) STRAIN: ATCC 24292 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 899 

CAAGCTGAAA GCCGAGCGTG AGCGTGGTAT CACTATCGAC ATTGCCCTCT 5 0 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGCAT 100 

TATCTCACTC CTCACAGAAG CACGCTCCTA ACATCACACA GACGCTCCCG 150 

45 GTCACCGTGA TTTCATCAAG AACATGATCA CTGGTACCTC CCAGGCCGAC 200 

TGCGCTATCC TCATTATCGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 250 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCC TACACCCTCG 300 

GTGTGCGTCA GCTCATCGTC GCCATCAACA AGATGGACAC CACCAAGTGG 3 50 

TCTGAGGCCC GTTACCAGGA GATCATCAAG GAGACCTCCA ACTTCATCAA 400 

50 GAAGGTCGGC TACAACCCCA AGACCGTCGC TTTCGTCCCC ATCTCTGGCT 450 

TCAACGGCGA CAACATGCTT GCCCCCTCCA CCAACTGCCC CTGGTACAAG 500 

GGATGGGAGC GTGAGGTCAA GGGCAACAAG CAGACCGGCA AGACCCTCCT 550 

CGAGGCCATT GACGGCATTG AGCCCCCCAA GCGTCCTTCC GACAAGCCCC 600 

TCCGTCTTCC TCTCCAGGAT GTTTACAAGA TCGGTGGTAT CGGAACTGTT 650 

55 CCTGTCGGCC GTATCGAGAC TGGTGTCCTC AAGCCCGGTA TGGTCGTTAC 700 

CTTCGCTCCC GCCAACGTCA CCACTGAAGT CAAGTCCGTC GAGATGC AC C 7 50 

ACGAGCAGCT CACTGAGGGT CTTCCCGGTG ACAACGTCGG TTTCAACGTG 800 

AAGAACGTTT CCGTCAAGGA CATCCGCCGT GGTAACGTTG CCAGTGACTC 850 

CAAGAACGAC CCCGCTCTGG GTGCCGCTTC TTTCGACGCC CAGGTCATCG 900 

60 TCCTCAACCA CCCCGGTCAG GTCGGTGCTG GTTACGCCCC GGTCCTCGAT 950 

708 
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TGCCACACTG CCCACATTGC TTGCAAGTTC 
CGACCGCCGT ACCGGCAAGT CTGTTGAGGA 
CTGGTGACTC TGCCATCGTC AAGATGGTTC 
GAGGCTTTCA CCGACTACCC TCCTCTGGGC 



TCCGAGATCA AGGAGAAGAT 1000 

CGCCCCCAAG TTCATCAAGT 1050 

CCTCCAAGCC CATGTGCGTT 1100 

CGTTTCGCCG TCCGTGA 1147 



2 ) INFORMATION FOR SEQ ID NO: 900 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
20 (B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 900 

CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

25 GTTCGAGACT CCCAAGTACT ATGTCACCGT CATTGGTACG TCGACTCGCG 100 

C GAG ACTGGT CGCAATTTCC ACGTCGCTAA CGTGCTTGAA CAGACGCTCC 15 0 

CGGCCACCGT GACTTCATCA AGAACATGAT C ACTGGT AC C TCCCAGGCTG 2 00 

ACTGCGCTAT CCTCATTATC GCTGCCGGCA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGC C AG AC CCGTGAGCAC GCTCTGCTCG CCTACACCCT 300 

30. . C GGTGTT AAG CAGCTCATCG TCGCTATCAA CAAGATGGAC ACCACCAAGT 350 

GGTCTGAGGC CCGTTTCCAG GAGATCATCA AGGAGACCTC CAACTTCATC 400 

AAGAAGGTCG GCTACAACCC CAAGACCGTC GCTTTCGTCC CCATCTCTGG 450 

TTTCCACGGC GACAACATGC TTTCCCCCTC CACCAACTGC CCCTGGTACA 500 

AGGGCTGGGA GAAGGAGACC AAGGC TGGC A AGTCCACCGG CAAGACCCTC 550 

3 5 CTTGAGGCCA TCGACTCCAT CGAGCCCCCC AAGCGCCCCA GCGACAAGCC 600 

CCTCCGCCTT CCCCTTCAGG ATGTGTACAA GATCGGCGGT ATCGGCACAG 650 

TCCCTGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG CATGGTCGTG 7 00 

ACCTTCGCTC CTTCCAACGT CACCACCGAA GTCAAGTCCG TTGAGATGCA 7 50 

C C ACGAGC AG CTCTCCGAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

4 0 TCAAGAACGT CTCCGTCAAG GAGATCCGTC GTGGCAACGT CGCCGGTGAC 850 

TCCAAGAACG ACCCCCCTCT GGGTGCCGCT TCTTTCGATG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGGTCGGTGC TGGCTACGCC CCCGTCCTCG 950 

ACTGCCACAC CGCCCACATT GCCTGCAAGT TCGC CGAG AT CAAGGAGAAG 1000 

ATCGACCGCC GTACCGGCAA GTCTGTCGAG TCCGCCCCCA AGTTCATCAA 1050 

45 GTCTGGCGAC TCTGCCATCG TCAAGATGAT TCCCTCCAAG CCCATGTGCG 1100 

TTGAGGCTTT CACCGACTAC CCTCCTCTGG GCCGCTTCGC CGTCCGTGAC 1150 



50 2 ) INFORMATION FOR SEQ ID NO: 901 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

709 
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(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32075 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901 

5 TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 2 00 

10 GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

ACCGATAAAC CTCTCCGTCT TCCCCTCCAG GATGTCTACA AG ATC TCCGG 300 

TATTGGCACT GTTCCTGTCG GACGTGTTGA GACTGGAGTC ATCAAGCCCG 350 

GTATGGTCGT GACCTTCGCT CCCGCCAACG TCACCACTGA AGTCAAGTCC 400 

GTCGAAATGC ACCACCAGCA GCTTACCGCC GGTAACCCCG GTGACAACGT 450 

15 CGGCTTCAAC GTCAAGAATG TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 500 

TTGCCGGTGA CTCTAAGAAT GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 550 

GCCCAGGTCA TCGTCCTCAA CCACCCTGGT CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCATA CTGCCCACAT TGCCTGCAAA TTCGCTGAGC 650 

TCCTTGAGAA GATTGATCGA CGAACCGGAA AGTCTGTTGA GAACAACCCC 700 

2 0 AAGTTCATCA AGTCCGGTGA TGCTGCTATC GTCAAGATGA TTCCTTCCAA 750 
G 



25 2) INFORMATION FOR SEQ ID NO: 902 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 7968 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 902 

40 



CGTGAGCGCG GT ATC AC CAT C GAT ATTGCT CTGTGGAAGT TCGAGACCCC 50 

CAAGTACTAC GTCACCGTCA TTGACGCCCC CGGTCATCGC GATTTCATCA 100 

AGAACATGAT CACTGGTACC TCGCAGGCCG ACTGCGCCAT TCTCATCATT 150 

GCCGCTGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG ATGGCCAGAC 200 

45 TCGTGAGCAC GCTCTGCTCG CCTACACCCT GGGTGTGCGG CAGCTGATCG 250 

TCGCCATCAA CAAGATGGAC ACGGCCAAGT GGGCTGAGGC TCGTTACCAG 3 00 

GAGATCATCA AGGAGACCTC CAACTTCATC AAGAAGGTCG GCTACAACCC 350 

CAAGACTGTT GCCTTCGTCC CCATCTCGGG CTTCCACGGC GACAACATGC 400 

TTACTCCCTC GACCAACTGC CCCTGGTACA AGGGCTGGGA GAAGGAGGGC 450 

50 AAGAGCGGCA AGGTTACCGG TAAGACTCTG CTGGACGCCA TTGACGCCGT 500 

CGAGCCCCCC AAGCGCCCCA CGGACAAGCC CCTGCGTCTG CCCCTCCAGG 550 

ATGTCTACAA GATCGGCGGT ATCGGCACTG TCCCTGTCGG CCGTATCGAG 600 

ACTGGTGTCC TGAAGCCCGG CATGGTCGTC ACCTTTGCCC CGTCCAACGT 650 

CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACGAGCAG CTTGTTGAGG 700 

55 GTGTTCCCGG CGACAACGTC GGCTTCAACG TCAAGAACGT CTCCGTCAAG 750 

GAG ATC CGTC GTGGCAACGT TGCCGGTGAC TCCAAGAACG ACCCCCCCTC 800 

GGGCGCCGCC ACCTTCAACG CCCAGGTCAT TGTCCTGAAC CACCCCGGCC 850 

AGGTCGGCAA CGGCTACGCC CCGGTTCTGG ACTGCCACAC CGCCCACATT 900 

GCCTGCAAGT TCACCGAGAT CCTTGAGAAG ATCGACCGCC GTACCGGCAA 950 

60 GTCGGTTGAG AACAACCCCA AGTTCATCAA GTCGGGTGAC GCCGCCATTG 1000 

710 
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15 



TCAAGCTGAC GCCCTCGAAG CCCATGTGCG TTGAGGCCTT CACTGACTAC 1050 
CCCCCT 1056 



2 ) INFORMATION FOR SEQ ID NO: 903 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 66 bases 
10 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicilli urn marneffei 

(B) STRAIN: ATCC 58950 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 903 

CAAGGCTGAG CGTGAGCGTG GT ATC AC CAT CGATATTGCT CTCTGGAAGT 5 0 

TCCAGACTGC CAAGTACGAG GTTACCGTCA TTGACGCCCC CGGTCACCGT 100 

GATTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCCG ATTGCGCTAT 150 

25 TCTCATCATT GCCTCTGGTA CTGGTGAATT CGAGGCTGGT ATCTCCAAGG 2 00 

ATGGCCAGAC TCGTGAGCAC GCTCTTTTGG CTTTCACCCT CGGTGTCCGT 250 

CAGCTCATTG TTGCCCTCAA CAAGATGGAC ACTTGCAAGT GGTCTCAGGG 300 

TGAGTAC TCG TACCTGCGTT TGGCCTTGAA TATCTTACTA ATGCACCATA 350 

GATCGTTACA AC GAAATTGT CAAGGAGACT TCCAACTTCA TCAAGAAGGT 400 

3 0 CGGATACAAC CCTAAGAACG TT.CCTTTCGT TCCTATCTCC GGTTTCAACG . 45 0 

-GTGACAACAT GCTTGAGCCC TCCCCCAACT GCCCCTGGTA CAAGGGTTGG '500 

GAGAAGGAGA CCAAGGCCGG TAAGGTCACT GGTAAGACCC TCCTCGAGGC 550 

CATCGACGCC ATTGAGCCCC CTACCCGTCC CGCCAACAAG GTTAGTCCCT 600 

CCTCGACTAC TCAAACCCTC CTCATAAGTT CAGATTACTG ACTCGTTCAC 65 0 

3 5 AGCCCCTCCG TCTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATTGGA 7 00 

ACGGTTCCCG TCGGTCGTGT TGAG AC TGGT ACCATCGTTC CTGGTATGGT 7 50 

TGTCACCTTG TAAGTCACTC TCCTCGCTTA TCCTACCTGA AATCATCATG 800 

TGCTAACTTG ACACTCAGCG CTCCCGCCAA CGTCACCACT GAAGTCAAGA 85 0 

GTGTTGAAAT GCACCACCAG CAGCTCACTG CCGGTCAGCC CGGTGACAAC 900 

40 GTTGGTTTCA ACGTGAAGAA CGTCTCCGTC AAGGAAATCC GTCGTGGTAA 950 

CGTTGCTGGT GACAGCAAGA ACGACCCCCC TGCCGGTGCT GCCTCCTTCA 1000 

ACGCCCAGGT CATCGTCCTC AACCACCCCG GTCAGGTCGG TGCTGGTTAC 105 0 

GCCCCAGTCC TCGATTGCCA CACTGCCCAC ATTGC TTGC A AGTTC GCTGA 1100 

GCTCCTCGAG AAGATTGACC GTCGTAC CGG AAAGTCTGTT GAGGACCACC 1150 

45 CCAAGTTCAT C AAGTC CGGT GACGCTGCCA TCGTCAAGAT GATTCCTTCC 12 00 

AAGCCCATGT GTGTTGAGGC TTTCACCGAG TACCCTCCTC TCGGTCGTTT 12 50 

CGCCGTTCGC GAGTAAGTTT TATCTCCGTT GTCTATTTTC CATCCTTCCC 13 00 

TTCTCCTCCG TCTTCCATAT ATATTTTTTC AGTTATATGT GAC TAACC AC 13 50 

AAATCACGGG AATAGC 13 66 

50 

2 ) INFORMATION FOR SEQ ID NO: 904 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



711 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularla lunata 

(B) STRAIN: ATCC 26425 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904 



CTATTATCGT 

10 CATCTGCTGC 
CAACAAGGTC 
TGGAGATGCG 
CCCATCATCA 
GATTGGTGTC 

15 TCCCCACCCC 
GATGTCTTCT 
GCGAGGTATC 
CCGCCCCCAT 
TGCGAGGAGT 

2 0 CAAGCGTGAT 
TCAAGGCGCA 
GAAGGTGGTC 
CATCCGCACT 
AGGACGCCCA 

25 TGCGAGCTCC 



TGTTGCCGCT 
TCGCCCGCCA 
GATGCTGTTG 
CGAATTGCTC 
TGGGATCTGC 
AACCGAATTG 
TCAGCGTGAG 
CCATTGCTGG 
CTGAAGCGCG 
CAAGACCAAG 
CTCGCGCTGG 
GAAGTCCGCC 
CAAGAAGTTC 
GTCACACTGG 
GCCGACGAGT 
TGACAAGCTT 
ATGCACCACA 



TCCGACGGTC 
GGTCGGTGTC 
AGGACAAGGA 
AGCAGCTACG 
CCTCTGCGCC 
ATGAGCTGCT 
AC C G AC AAGC 
TCGTGGCACT 
ATGCTGAAGT 
GTTACCGATA 
TGACAACTCC 
GTGGTATGGT 
CTTGTCTCCA 
CTTCGGTGAG 
CGTGTGCCCT 
GTTATGCCCG 
CGTCTTGGAG 



AAATGCCCCA 
CAGAAGATCG 
GATGTTGGAG 
GCTTCGAGGG 
ATTGAGGGCC 
CGAGGCCGTT 
CTTTCCTCAT 
GTCGTCTCTG 
TGAGCTCGTC 
TCGAGACCTT 
GGTCTCCTTC 
CGTTTCCGTC 
TGTACGTGTT 
AACTACAGGC 
GTACTGGCCA 
GTGACAACGT 
CCTGGTCAAC 



GACTCGTGAG 50 

TTGTCTTCGT 100 

CTCGTCGAGA 150 

CGACGAGACT 200 

GCGAACCTGA 250 

GATACTTGGA 300 

GGCCGTCGAG 350 

GCCGTGTCGA 400 

GGCAAGGGCA 45 0 

CAAGAAGTCC 500 

TTCGTGGTGT 55 0 

CCTGGACAGG 600 

GAGCAAGGAG 650 

CGCAAATGTT 7 00 

GAAGGCACCG 75 0 

CGAGATGGTT 800 

G 841 



30 



2 ) INFORMATION FOR SEQ ID NO: 905 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

.(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905 

45 CGGTGCTATC ATTGTCGTCG CCGCCTCCGA CGGTCAGATG TACGTTAACC 50 

TTAAAAGAAT AACTCTCCTT CAGTATATAT GCTTACACTG GCGATCAACA 100 

GGCCCCAGAC TCGTGAGCAC TTGCTGCTTG CTCGTCAGGT CGGTGTCCAG 150 

AAGATCGTTG TCTTCGTCAA CAAGGTCGAT GCTATCGATG ACCCCGAGAT 200 

GCTGGAGCTC GTTGAGCTGG AAATGCGCGA GCTTCTCAGC ACCTACGGAT 250 

50 TCGAGGGTGA GGAGACCCCC ATCGTCTTCG GCTCTGCTCT CTGCGCCATT 300 

GAGGACCGCC GCCCCGACAT CGGTACCGAG CGTATCGACG CTCTCCTCGA 3 50 

GGCCGTTGAC ACCTGGATCC CCACTCCCCA GCGTGACCTT GACAAGCCTT 400 

TCTTGATGTC CATTGAGGAA GTTTTCTCCA TCCCCGGTCG TGGTACCGTC 450 

GCCTCCGGCC GTGTCGAGCG TGGTCTCCTG AAGCGTGATA GCGAGGTTGA 500 

55 GATCATCGGT ACCACCAACG AGGTCATCAA GACCAAGGTT ACCGACATTG 550 

AGACCTTCAA GAAGTCCTGC TCCGAGTCCC GCGCCGGTGA CAACTCCGGT 600 

CTCCTGCTCC GTGGTGTCCG CCGTGAGGAT CTCCGCCGTG GTATGGTCAT 650 

TGCCGCTCCT GGCAGCGCCA AGGCCAACAG CAAGTTCATG GTCTCCATGT 700 

ACGTCCTGAC CGAGGCTGAR GGTGGTCGCC GTACCGGTTT CGGTGTCCAG 7 50 

60 TACCGTCCCC AGC TGTTC AT CCGCACTGCC GGTAAGTAAA ATTGCATTCT 800 

712 
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ATTCCGCTAC TAGGGAACCA TCTCTAATTC 
GCTGAGTTCA GCTTCCCCGA CGGAGACCAG 
TGACAACGTC GAGATGATCG TCAAGACCCA 
CCGGTCAGCG CTTCAAC 



TATTTGCTAC AGATGAGGCT 850 
TCCCGCCGTA TCATGCCCGG 900 
CCGCCCCGTC GCCGCCGAGG 95 0 

967 



2 ) INFORMATION FOR SEQ ID NO: 906 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bipolaris hawaliensls 
20 (B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 906 

TGGTGCTATT ATTGTTGTTG CCGCTTCCGA CGGTCAAATG CCCCAGACTC 50 

25 GTGAGCATCT GCTGCTCGCC CGTCAGGTCG GTGTTCAGAA GATCGTTGTC 100 

TTCGTTAACA AGGTCGACGC TGTCGAGGAC AAGGAGATGT TGGAGCTTGT 150 

CGAGATGGAG ATGCGCGAAC TGCTCAGCAG CTATGGCTTC GAGGGCGACG 2 00 

AGACCCCTAT CATCATGGGT TCTGCTCTCT GCGCCATTGA AGGCCGTCAA 2 50 

CCCGACATTG GTGTCGAACG AATTGACGAG CTGCTCGAGG CTGTTGATAC 3 00 

3 0 -TTGG ATTCC C ACCCCTCAGC GTGAGACCGA AAAGCCTTTC CTCATGGCCG 350 

TCGAGGATGT CTTCTCCATT GCTGGTCGTG GTACCGTCGT CTCTGGCCGT 4 00 

GTCG AAC GAG GTATCCTGAA GCGCGATGCT GAAGTTGAGC TTGTGGGCAA 45 0 

GGGCAGCGCA CCCATCAAGA CCAAGGTTAC CGATATCGAG ACCTTCAAGA 5 00 

AGTCTTGCGA GGAGTCCCGC GCTGGTGACA ACTCCGGTCT CCTTCTTCGT 550 

3 5 GGTGTTAAGC GTGATGAAGT CCGCCGTGGT ATGGTCGTTT CCGTCCCTGG 600 

ACAGGTTAAG GCGCACAAGA AGTTCCTTGT CTCCATGTAT GTGCTGAGCA 65 0 

AGGAGGAAGG TGGCCGACAC ACTGGCTTCG GTGAGAACTA CAGGCCGCAA 7 00 

ATGTTCATCC GCACTGCCGA CGAGTCGTGT GCCCTGTACT GGCCAGAAGG 7 50 

CACCGAGGAT GCCCACGACA AGCTTGTCAT GCCCGGTGAC AACGTCGAGA 80 0 

40 TGGTTTGCGA GCTCCATGCA CCACACGTCT TGG AG AC TGG TCAGCGCTTC 85 0 

AA 852 



45 2 ) INFORMATION FOR SEQ ID NO: 907 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 26947 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 907 

713 
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GGTGCTATTG TTGTCGTTGC TGCTTCGGAT GGTCATGATG TATGGACAGG 50 

CCCTTTGCTA CTGAATGGTT TCAAGATCTC GCGCTTACAC GTATTATAAT 100 

AGGCCCCAGA CCCGGGAGCA CTTGCTGCTT GCCCGTCAGG TCGGTGTCCA 150 

GAAGATCGTC GTTTTTGTCA ACAAGATTGA TGCCGTTGAG GACCCTGAGA 2 00 

5 TGTTGGAGCT TGTCGAGTTG GAAATGCGCG AGCTCCTTAG CAGCTACGGC 250 

TTCGAGGGCG AAGAGACTCC CATCATCTTC GGTTCTGCTC TGTGTGCTTT 3 00 

GGAGGACCGT CGCCCCGACA TTGGTGCCGA GCGTATCGAC GAGCTCATGA 350 

AGGC CGTTGA CACCTGGATC CCTACCCCTC AGCGTGATCT TGACAAGCCT 400 

TTCCTCATGT CTGTCGAGGA AGTCTTCTCC ATCGCCGGTC GTGGTACCGT 450 

10 TGCCTCCGGC CGTGTCGAAC GTGGTATCCT GAAGAAGGAC AGCGAAGTCG 500 

AGATCATCGG AGGTAGCTTC GATGC T AC C A AGACCAAGGT CACCGACATT 550 

GAGACCTTCA AG AAGTC TTG TGACGAGTCC CGCGCTGGTG ACAACTCTGG 600 

CTTGCTTCTC CGTGGTATCC GTCGTGAAGA CGTCCGCCGC GGAATGATCA 650 

TTGCTGCTCC TGGCAGCACC AAGGCCCACG ACCAGTTCTT GGTGTCCATG 7 00 

15 TACGTTCTCA CTGAGGCTGA GGGTGGTCGT CGTACTGGCT TCGGCTCCAA 7 50 

CTACCGCCCC CAGGTGTTCG TTCGCACTGC TGGTAAGTCA AGCCTTTTGC 800 

TCACTTAACG GTATTGATTA AGTTCTAACT GTTGTATCCT AGATGAGGCT 850 

GCTGACCTCA GCTTCCCCGA CGGTGATGAG TCCCGGAGGG TGATGCCTGG 900 

TGACAACGTC G AG ATGGTC C TCAAGACTCA CCGCCCCATT GCTGCTGAGG 950 

20 CTGGCCAGCG CTTCAA 9 66 



2 ) INFORMATION FOR SEQ ID NO: 908 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Alternaria alternate 

(B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 908 

40 GGTGCTATCA TCGTCGTTGC TGCTTCCGAT GGTCAGATGC CCCAGACCCG 50 

TGAGCACTTG CTGCTCGCCC GTCAGGTCGG TGTTCAGAAG ATCGTTGTCT 100 

TCGTCAACAA GGTCGATGCT GTCGAAGACC CGGAGATGTT GGAACTCGTC 150 

GAGATGGAGA TGCGTGAGTT ACTCACCAGC TACGGCTTCG AGGGCGACGA 2 00 

GACACCCATC ATCATGGGTT CCGCTCTATG CGCCATCGAG GGCCGCCAGC 2 50 

45 CCGAGATCGG TGTTACCAAG GTCGACGAGC TAATGGACGC TGTCGACTCA 3 00 

TGGATCCCCA CCCCTCAGCG TGAGACCGAG AAGCCTTTCC TCATGGCTGT 3 50 

TGAGGATGTC TTCTCGATTG CTGGACGTGG TAC CGTCGTT TCGGGCCGTG 400 

TCGAGCGCGG TATCTTGAAG CGTGACGCTG AAGTCGAGCT TGTCGGCAAG 450 

GGCACCGCGC CAATCAAGAC CAAGGTCACT GATATTGAGA CCTTCAAGAA 500 

50 GTCGTGCGAG GAGTCGCGCG CGGGTGATAA CTCCGGTCTT CTCCTCCGTG 550 

GTGTCAAGCG TGATGACGTT CGCCGCGGTA TGGTTGTTTC CGTTCCCGGA 600 

CAAGTCAAGG CTCACAAGAA GTTCCTTGTC TCCATGTACG TTCTAAGCAA 650 

AGAGGAGGGT GGTCGTCACA CCGGCTTCGG CGAGAACTAC AGGCCGCAAA 7 00 

TGTTCATCCG AACTGCCGAT GAATCCTGCG CACTTCACTT CCCAGAGGGT 7 50 

55 ACCGAGGATG CGCACGACAA GCTAGTTATG CCCGGTGACA ACGTCGAGAT 800 

GGTCTGCGAA CTCCACCAGC CCCACGTTCT AGAGACCGGT CAGCG 845 



60 2 ) INFORMATION FOR SEQ ID NO: 909 

714 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peniclllium marneffei 

(B) STRAIN: ATCC 64101 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 09 



CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 2 00 

2 0 TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 3 00 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GAT AC C TGG A 3 50 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 400 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 450 

2 5 GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 55 0 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 65 0 

CCAAGGC TC A TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 7 00 

3 0 GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 85 0 

CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 

35 



2 ) INFORMATION FOR SEQ ID NO: 910 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 
50 (B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

55 AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 2 00 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 25 0 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 3 00 

60 GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATAC C TGG A 3 50 

715 
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TCCCCACCCC CCAGCGTGAC 
GAAGTTTTCT CCATCTCTGG 
GCGTGGTATT TTGCGCAAGG 
AGAACCCTAT CAAGACCAAG 
5 TGCGATGAAT CTCGTGCTGG 
CAAGCGTGAG GACATTCGTC 
CCAAGGCTCA TGACAACTTC 
GAAGGTGGTC GTCGTACTGG 
CATCCGTACT GCCGGTATGT 
10 GCTAACGAGT TATAGATGAG 
CAGTCCAAGC AGGTCATGCC 
ACACCGTCCC GTTGCCGCCG 



CTTGACAAGC CCTTCTTGAT 
TCGTGGTACC GTTGCATCTG 
ATTCTGAGGT TGAGATTATC 
GTTACCGACA TTGAGACCTT 
TGACAACTCT GGCTTGCTTC 
GTGGTATGGT TATCGCTGCT 
TTGGTCTCCA TGTATGTCTT 
ATTCGGCGCC AACTACCGTC 
TCCCTTTCAA AGTCAATTAA 
GCTGCTACTC TCAGCTTCCC 
CGGTGACAAC GTTGAGATGA 
AAGCTGGTCA G 



GTCCGTTGAG 


400 


GTCGTGTTGA 


450 


GGATACCAGA 


500 


CAAGAAGTCT 


550 


TCCGTGGTAT 


600 


CCTGGAACCA 


650 


GACTGAGGCT 


700 


CTCAAGCTTT 


750 


TGAGCGATTT 


800 


CGGTGACGAT 


850 


TCTTGAAGAC 


900 




931 



15 

2 ) INFORMATION FOR SEQ ID NO : 911 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



25 



30 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911 
GACGGMKKCA TGCCGCARAC 20 

2) INFORMATION FOR SEQ ID NO: 912 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 912 
GACGGCGKCA TGCCGCARAC 20 

45 



2 ) INFORMATION FOR SEQ ID NO: 913 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913 
60 GACGGYSYCA TGCCKCAGAC 

716 
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2 ) INFORMATION FOR SEQ ID NO: 914 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 914 

15 

GAARAGCTGC GGRCGRTAGT G 21 



2 0 2) INFORMATION FOR SEQ ID NO: 915 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

2 5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 915 

AAACGCCTGA GGRCGGTAGT T 21 



35 



45 



50 



2 ) INFORMATION FOR SEQ ID NO : 916 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 916 
GCCGAGCTGG CCGGCTTCAG 20 

2 ) INFORMATION FOR SEQ ID NO: 917 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: DNA 



717 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917 
TCGTGCTACC CGTYGCCGCC AT 

5 

2 ) INFORMATION FOR SEQ ID NO: 918 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1391 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
20 (E) ACCESSION NUMBER: J01672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918 

AG AG AAGCC T GTCGGC AC CG TCTGGTTTGC TTTTGCCACT GCCCGCGGTG 50 

2 5 AAGGCATTAC CCGGCGGGAT GCTTCAGCGG CGACCGTGAT GCGGTGCGTC 100 

GTCAGGCTAC TGCGTATGCA TTGC AG AC C T TGTGGC AAC A ATTTCTACAA 150 

AACACTTGAT ACTGTATGAG CATACAGTAT AATTGCTTCA ACAGAACATA 200 

TTG ACT ATC C GGTATTACCC GGCATGACAG GAGTAAAAAT GGCTATCGAC 2 50 

GAAAACAAAC AGAAAGCGTT GGCGGCAGCA CTGGGCCAGA TTGAGAAACA 300 

3 0 ATTTGGTAAA GGCTCCATCA TGCGCCTGGG TGAAGACCGT TCCATGGATG 350 

TGGAAACCAT CTCTACCGGT TCGCTTTCAC TGGATATCGC GCTTGGGGCA 400 

GGTGGTCTGC CGATGGGCCG TATCGTCGAA ATCTACGGAC CGGAATCTTC 450 

CGGTAAAACC ACGCTGACGC TGCAGGTGAT CGCCGCAGCG CAGCGTGAAG 500 

GTAAAACCTG TGCGTTTATC GATGCTGAAC ACGCGCTGGA CCCAATCTAC 55 0 

3 5 GCACGTAAAC TGGGCGTCGA TATCGACAAC CTGCTGTGCT CCCAGCCGGA 600 

CACCGGCGAG CAGGCACTGG AAATCTGTGA CGCCCTGGCG CGTTCTGGCG 650 

CAGTAGACGT TATCGTCGTT GACTCCGTGG CGGCACTGAC GCCGAAAGCG 7 00 

GAAATCGAAG GCGAAATCGG CGACTCTCAC ATGGGCCTTG CGGCACGTAT 750 

G ATGAGC C AG GCGATGCGTA AGC TGGCGGG TAACCTGAAG CAGTCCAACA 800 

4 0 CGCTGCTGAT CTTCATCAAC CAGATCCGTA TGAAAATTGG TGTGATGTTC 850 

GGTAACCCGG AAACCACTAC CGGTGGTAAC GCGCTGAAAT TCTACGCCTC 900 

TGTTCGTCTC GACATCCGTC GTATCGGCGC GGTGAAAGAG GGCGAAAACG 95 0 

TGGTGGGTAG CGAAACCCGC GTGAAAGTGG TGAAGAACAA AATCGCTGCG 1000 

CCGTTTAAAC AGGCTGAATT CCAGATCCTC TACGGCGAAG GT ATC AAC TT 1050 

45 CTACGGCGAA CTGGTTGACC TGGGCGTAAA AGAGAAGCTG ATCGAGAAAG 110 0 

CAGGCGCGTG GTAC AG CTAC AAAGGTGAGA AGATCGGTCA GGGTAAAGCG 115 0 

AATGCGACTG CCTGGCTGAA AG AT AACCC G GAAACCGCGA AAGAGATCGA 1200 

GAAGAAAGTA CGTGAGTTGC TGCTGAGCAA CCCGAACTCA ACGCCGGATT 125 0 

TCTCTGTAGA TGATAGCGAA GGCGTAGCAG AAACTAACGA AGATTTTTAA 1300 

50 TCGTCTTGTT TGATACACAA GGGTCGCATC TGCGGCCCTT TTGCTTTTTT 1350 

AAGTTGTAAG GATATGCCAT GACAGAATCA ACATCCCGTC G 1391 



22 



55 2 ) INFORMATION FOR SEQ ID NO: 919 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 

718 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919 

GGICCIGART CITMIGGIAA RAC 23 



10 



20 



25 



2) INFORMATION FOR SEQ ID NO: 920 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 920 
TCICCVATIT CICCITCIAI YTC 23 

2 ) INFORMATION FOR SEQ ID NO: 921 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 921 
TIYRTIGAYG CIGARCAIGC 2 0 

40 

2 ) INFORMATION FOR SEQ ID NO : 922 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922 
55 TARAAYTTIA RIGCIYKICC ICC 23 



2 ) INFORMATION FOR SEQ ID NO : 923 

60 



719 



CA 02307010 2000-05-19 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923 

.0 

GACGCIGCCA TCCTGATGAT C 



15 2 ) INFORMATION FOR SEQ ID NO: 924 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924 

ACCTCAGTCG TCACGTTGGC G 



2 ) INFORMATION FOR SEQ ID NO : 925 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 
35 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925 
AAGCAGATGG TTGTGTGCTG 

45 

2 ) INFORMATION FOR SEQ ID NO: 926 

(i) SEQUENCE CHARACTERISTICS: 
5 0 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 926 
CAGCTGCTCG TGGTGCATCT CGAT 

60 

720 
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10 



20 



2 ) INFORMATION FOR SEQ ID NO: 927 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927 
15 ACGCGGAGAA GGTGCGCTT 19 



2 ) INFORMATION FOR SEQ ID NO: 928 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 928 

30 

GGTCGTTCTT "CGAGTCACCG CA" 22 



35 2 ) INFORMATION FOR SEQ ID NO : 929 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides fragilis 

(B) STRAIN: ATCC 25285 



50 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 929 



TTCAGCATGC CATTTCAAAA CAGGCCGAAG CCGATATCGT GATTATCGCT 50 

GCTTGTGGGG AGCGTGCAAA TGAAGTTGTG GAAATCTTTA CCGAATTTCC 100 

GGAATTGGTG GACCCGCACA CGGGACGTAA GCTGATGGAG CGTACCATTA 150 

TTATCGCAAA TACATCGAAC ATGCCGGTAG CAGCGCGTGA AGCTTCTGTG 200 

55 TATACGGCCA TGACGATTGC CGAATACTAT CGTGCCATGG GATTGAAAGT 250 

CCTGCTGATG GCAGACTCCA CTTCCCGTTG GGCGCAGGCA TTGCGTGAGA 300 

TGTCGAACCG TATGGAGGAG TTGCCCGGAC CGGATGCATT CCCGATGGAC 350 

CTGTCCTCAA TCATTTCTAA CTTCTATGGC CGTGCAGGCT ACGTGAAACT 400 

GAATAACGGC GAGAGCGGTT CTATTACCTT TATC GGT AC A GTATCACC 448 
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2 ) INFORMATION FOR SEQ ID NO: 930 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonls 
15 (B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 930 

GCTATCTCTA AACAAGCGGA AGCGGATATC GTGATTATCG CCGCCTGCGG 50 

20 TGAGCGTGCG AATGAGGTCG TAGAGGTATT TACGGAGTTC CCGGAATTGG 100 

TAG ACC C GC A TACGGGACGT AAATTGATGG AACGTACGAT CATTATCGCC 150 

AATACATCCA AC ATGCC GGT AGCCGCTCGT GAGGCATCCG TATATACGGC 200 

GATGACCATC GCCGAGTATT ATCGCAGCAT GGGTTTGAAG GTTC TGTTGA 250 

TGGCCGACTC TACTTCCCGC TGGGCACAGG CTTTGCGTGA GATGTCCAAC 300 

2 5 CGTTTGGAGG AGTTGCCGGG ACCGGATGCT TTC CCGATGG ACTTGTCCGC 350 

TATCGTGGCG AACTTCTACG CTCGTGCGGG ATTC GTTC AT TTGAATAACA 400 

ACGCTACAGG CTCCGTCACT TTCATCGGTA CGGTATCG 438 



30 



2) INFORMATION FOR SEQ TD NO: 931 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 453 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas asaccharolytica 

(B) STRAIN: ATCC 25260 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931 

CCTCCAGCAC GCTATCTCTA AGCAGGCGGA GGCTGATATC GTCATTATGG 5 0 

CAGCCTGCGG TGAGCGTGCT AATGAGGTGG TGGAGATCTT TGCCGAGTTC 100 

CCTGAGCTCG AAGACCCACA CACGGGACGC AAGCTGATGG AGCGTACGAT 150 

50 CATCATCGCT AACACGAGTA ACATGCCAGT GGCTGCTCGT GAGGCTTCGG 200 

TCTACACCGC TATGAC CATC GCTGAGTACT AC CGC TCGAT GGGTCTCAAA 250 

GTACTCCTAA TGGCTGACTC GACCTCTCGC TGGGCACAGG CACTGCGTGA 300 

GATGTCTAAC CGTCTAGAGG AGCTGCCTGG AC C AG ATGC A TTCCCGATGG 350 

AC TTGTC GGC TATCGTGGCA AACTTCTACG CTCGTGCCGG CTTCGTCTAT 400 

55 CTCAACAACG GTGAGACAGG TTCTGTAACC TTCATCGGTA CGGTCTCTCC 45 0 
AGC 



453 



60 2) INFORMATION FOR SEQ ID NO: 932 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria, monocytogenes 

(B) STRAIN: ATCC 15313 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932 



CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 5 0 

GTGAACATAT CTTACTTTCA CGTCAAGTTG GTGTTC C ATA CATCGTTGTA 100 

TTCATGAACA AATGTGACAT GGTTGACGAT GAAGAATTAC TAGAATTAGT 15 0 

TGAAATGGAA ATTCGTGATC TATTAACTGA ATATGAATTC CCTGGCGATG 2 00 

20 ACATTCCTGT AATCAAAGGT TCAGCTCTTA AAGCACTTCA AGGTGAAGCT 250 

GACTGGGAAG C TAAAATTG A CGAGTTAATG GAAGCTGTAG ATTCTTACAT 300 

TCCAACTCCW GAACGTGATA CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ATGTATTCTC AATCACTGGT CGTGGAACAG TTGCAACTGG ACGTGTTGAA 400 

CGTGGACAAG TTAAAGTTGG TGACGAAGTA GAAGTTATCG GTATCGAAGA 450 

25 AGAAAGCAAA AAAGTAGTAG TAACTGGAGT AGAAATGTTC CGTAAATTAC 500 

TAGACTACGC TGAAGCTGGC GACAACATTG GCGCACTTCT ACGTGGTGTT 55 0 

GCTCGTGAAG ATATCCAACR TGGTCAAGTA TTAGCTAAAC CAGGTTCGAT 600 

TACTCCACAC ACTAACTTCA AAGCTGAAAC TTATGTTTTA ACTAAAGAAG 65 0 

AAGGTGGACG TCACACTCCA TTCTTCAACA ACTACCGCCC ACAATTCTAT 7 00 

3 0 TTCCGTACTA CTGACGTAAC TGGTATTGTT ACACTTCCAG AAGGTACTGA 750 

AATGGTAAYG CCTGGTGATA ACATTGAGCT TGCAGTTGAA CTAATTGCAC 800 

CAATCGCTAT CGAAGAC GGT AC T AAATTC T CTATC 83 5 

35 

2) INFORMATION FOR SEQ ID NO: 933 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 933 
CATCATCGTI TTCMTGAACA ARTG 24 

50 

2 ) INFORMATION FOR SEQ ID NO: 93 4 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 

723 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 934 
TCACGYTTRR TACCACGCAG I AG A 24 

5 

2) INFORMATION FOR SEQ ID NO: 935 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 935 
20 GGIAARWSIC ARYTITGYCA YAC 23 



25 



35 



2) INFORMATION FOR SEQ ID NO: 93 6 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 
3 0 (D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 6 
TCISIYTCIG GIARRCAIGG 20 



40 2 ) INFORMATION FOR SEQ ID NO: 937 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 937 

ATI AC IGARG Y I TTYGG I G A RTT 23 

55 2 ) INFORMATION FOR SEQ ID NO : 938 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 

724 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 8 

CYIGTIGYIS WIGCRTGIGC 2 0 



10 



20 



2 ) INFORMATION FOR SEQ ID NO: 939 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1203 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
(C) ACCESSION NUMBER: D10023 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 939 

ATGTCTCAAG TTCAAGAACA ACATATATCA GAGTCACAGC TTCAGTACGG 50 

GAACGGTTCG TTGATGTCCA CTGTAC C AGC AG ACC TTTCA CAGTCAGTCG 100 

TTGATGGAAA CGGCAACGGT AGCAGCGAAG ATATTGAGGC CACCAACGGC 150 

3 0 TCCGGCGATG GTGGCGGATT GCAGGAGCAA GCGGAAGCGC AAGGTGAAAT 2 00 

GGAGGATGAA GCATACGATG AAGCTGCCTT AGGTTCGTTT GTGCCAATAG 250 

AAAAACTGCA AGTGAACGGG ATTACTATGG CGGATGTGAA AAAAC T AAGG 300 

GAGAGTGGGC TTCACACTGC TGAAGCGGTA GCATATGCTC CCAGAAAGGA 35 0 

TTTATTGGAA ATCAAAGGTA TATCGGAAGC TAAGGCAGAT AAGTTGCTAA 400 

3 5 ACGAAGCGGC AAGGCTAGTG C CTATGGG AT TTGTCACGGC TGCTGATTTT 45 0 

CATATGAGAA GATCGGAGCT GATTTGTTTG ACAACGGGTT CTAAGAATTT 500 

GGACACTCTT TTGGGTGGTG GTGTGGAAAC TGGTTCTATT AC TG AGCTTT 55 0 

TCGGTGAATT CAGGACAGGT AAGTCCCAGC TATGTCACAC TTTGGCCGTG 600 

ACATGCCAAA TTCCATTGGA TATTGGTGGC GGTGAAGGTA AGTGTTTGTA 65 0 

40 TATCGATACC GAAGGTACTT TCAGGCCGGT AAGATTGGTA TCCATAGCTC 7 00 

AGCGGTTCGG ATT AG ACC CG GATGATGCTT TGAACAACGT TGCGTATGCA 750 

AGAGCCTATA ACGCCGATCA TCAGTTAAGA CTTCTGGATG CTGCTGCCCA 800 

AATGATGAGC GAGTCTCGGT TTTCCTTGAT TGTGGTC GAT TCTGTTATGG 85 0 

CTCTATACCG TACGGATTTT TCTGGTCGTG GTGAACTAAG CGCAAGGCAA 900 

45 ATGCATTTAG CCAAATTTAT GCGTGCTTTG C AAAGGC TGG CCGACCAATT 950 

TGGTGTTGCA GTCGTCGTTA CTAACCAAGT GGTCGCCCAA GTTGATGGTG 1000 

GTATGGCTTT TAATCCAGAT CCAAAGAAGC CTATCGGTGG TAATATTATG 105 0 

GCACATTCTT CCACCACGCG ATTAGGTTTC AAAAAGGGTA AGGGATGTCA 1100 

AAGATTATGC AAAGTTGTTG ACTCACCTTG CTTACCAGAG GCTGAATGTG 115 0 

50 TGTTCGCGAT CTATGAAGAT GGTGTTGGTG ACCCCAGAGA AGAAGACGAG 12 00 

TAG 12 03 



55 2 ) INFORMATION FOR SEQ ID NO: 940 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sac char omyces cerevisiae 

(B) STRAIN: GRF88 

(C) ACCESSION NUMBER: M87549 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 940 



CGATCCAATT 
CTGAAGCTGA 
GAACAAGTTG 
15 CATCTTGTTT 
GAGGTAGGAT 
TCTAGGGATT 
CGCATGATAT 
CAAGTATATT 

2 0 GTTGTTTTTC 

AC AG AC CG AA 
TAGTTGACCC 
TTATATTTGC 
ATGTCTGTTA 
25 TAGTGTAGAT 
AATTGAAGTC 
TTTAAAACCT 
TAATTTGGAA 
TCTATGTAAA 

3 0 AAGCTGCTGG 

TTAGATATAA 
AGATTCAATC 
TTGGTGAATT 
ACCACGCAGC 

35 TATTGATACA 
AAGGTTATGA 
AGAGCCTTGA 
AGAACTTAGT 
CAAACTTCAG 

40 CAAAAGCTAA 
TAATGTTGCA 
CTGC ATTATT 
CTGGCACATG 
CGAAAGAGTT 

45 GTGTCTACGT 
TTGTATACTT 
TTTACTTTGT 



GCTGGTCTTA 
AGTCAAAGCT 
AATTAGCTGA 
GAAGACGTCT 
CCCTGAAGAT 
AATTAAATCG 
TCTTCCCTGG 
TTAACTGTAT 
TTCGGGTTTT 
AATAGCCGCC 
AAAAATTTGG 
TGTAGTCTGT 
CAGGAACTGA 
GAATTACAGA 
TGGTGGGATA 
CTGCTGTAGA 
AGGAAC TTTT 
ATTAAAGGGT 
AAAAATCATA 
GACAGCGTGT 
CTAGGTGGTG 
TAGGTGTGGT 
TTCCGAGGGA 
GAAGGCACTT 
ATTGGATCCC 
ATAGTGAACA 
TCTGGAGATT 
AGT AG AC TAC 
ATCAACATCT 
GTATTTCTGA 
TGCCTCGGCA 
CGTCAGCAAC 
GCCAAGTTAC 
AATTGG TGAA 
TTTTAATGAA 
GTT AC TAATA 



AGATGCATTT 
TACGACAAGT 
TGCTGCTCCT 
ACGTGAAAGG 
ACTTGGGACT 
TAAGGAAAAA 
AAGCGCCATT 
ATACAACAAT 
CCCACACATT 
CAAGGATAAA 
ATTCTACTTT 
TATGCCAATC 
GATCGATAGT 
ACTATGGTAT 
TACACAGTCA 
GGTTCTTTCC 
AT AG AC C GTT 
TAAGTGAGGT 
CAGGTGGGAT 
GTACTCCTTA 
GAATAATGAC 
AAGACACAGA 
AATGGGTGGT 
TCAGGCCCGA 
GAGTCATGTT 
TCAAATGGAA 
ATCGCCTTAT 
TGCGGTAGAG 
TTTCAAATTG 
CAAACCAAGT 
GATGGTAGGA 
AAGGATTTTG 
AAGATTCCCC 
AAAGGTATTA 
GATGACATTG 
TTATTAATAT 



GATTGATCTA 
CCGCTAGAAA 
CCTCCAGAAG 
TACAGAAACT 
TCAAAAAGCA 
TAAAATAATA 
TTATAGCAAG 
ATGACTCTTT 
GTGTGGAGAG 
CTTTTATATA 
CCAGATTTAC 
AGGAAAGCAT 
GATACAGCAA 
TAATGCCTCA 
ATGTATGTTA 
CCCTTTCTTT 
TTGTCAACAA 
GAAAGTGGAA 
TTATCCCCGC 
TCAACTGGAT 
AATGAGTATC 
TGTCTCATAC 
GGTGAAGGGA 
GAGGATTAAG 
TGGCAAACGT 
CTTGTTGAAC 
CGTGGTAGAT 
GTGAACTAAG 
AATAGATTGG 
TCAATCAGAC 
AACC AATTGG 
TTGAGAAAAG 
AG AT ATGC CT 
CCGATTCAAG 
CTCCTTTATT 
CTTGTTATGA 



GGTATTGCCA 


50 


ATACGTTGAC 


100 


CCAAATTATC 


150 


CCAACCCTAA 


200 


AGGTTTTGCC 


250 


GTGC TGTG AT 


300 


AAATGTAAGT 


350 


TTTATGCCTT 


400 


ATAGTTATTA 


450 


AAGGGAAGGG 


500 


TTTCACCCTT 


550 


TTGAACAAAT 


600 


AAAATATTCT 


650 


GATCTTCAAA 


700 


TAATAACATT 


750 


TACTAACTAA 


800 


CAAGAAGACA 


850 


AAAATTAAAG 


900 


TACTGTACAA 


950 


CTAAGCAACT 


1000 


ACTGAAGTAT 


1050 


TTTGTGTGTT 


1100 


AAGTAGCATA 


1150 


CAAATTGCAG 


1200 


TTCATATGCT 


1250 


AATTGGGTGA 


1300 


TCTATAATGG 


1350 


CGAAAGACAG 


1400 


CAGAGGAATT 


1450 


CCAGGTGCTT 


1500 


AGGGCACGTT 


1550 


GGCGTGGTGA 


1600 


GAAAAAGAAT 


1650 


TGACTAGTTT 


1700 


AAACTTTTCT 


1750 


TTCTTTGTTT 


1800 



50 
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2 ) INFORMATION FOR SEQ ID NO: 941 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus humicolus 

(B) STRAIN: ATCC 38294 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941 

CGTCCTTATC CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

TCTC CGTCTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGACCTG 10 0 

TACCACGAGA TGCGTGAGAC TGGTGTCATC AACCTCGAGG GCGACTCCAA 150 

2 0 GGTCGCTCTC GTCTTCGGCC AGATGAACGA GCCCCCCGGA GCCCGTGCCC 200 

GTGTCGCCCT TACCGGCCTC ACCATCGCCG AGTACTTCCG TGACGAGGAG 250 

GGTCAGGACG TGCTTCTCTT CATCGACAAC ATTTTCCGTT TCACCCAGGC 3 00 

CGGTTCCGAG GTGTCTGCCC TTCTCGGTCG TATCCCCTCG GCCGTCGGTT 350 

ACCAGCCCAC CCTCGCTACC GACATGGGTT CCATGCAGGA GCGTATCACC 400 

25 ACCACCAAGA AGGGTTCGAT TACCTCCGTC 430 



30 



2 ) INFORMATION FOR SEQ ID NO: 942 



(i) SEQUENCE— CHARACTERISTICS : 

(A) LENGTH : 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942 

45 CGTGTAC GAT GCTCTTGAGG TGCAAAATGG TAATGAGCGT CTGGTGCTGG 5 0 

AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TGCGTACCAT CGCAATGGGT 100 

TCCTCCGACG GTCTGCGTCG CGGTCTGGAT GTAAAAGACC TCGAACACCC 15 0 

GATCGAAGTC CCGGTAGGTA AAGCGACTCT GGGCCGTATC ATGAACGTAC 2 00 

TGGGTGAACC GGTCGACATG AAAGGCGAGA TCGGTGAAGA AGAGCGTTGG 2 50 

50 GCGATTCACC GCGCAGCACC TTCCTACGAA GAGCTGTCAA ACTCTCAGGA 3 00 

ACTGCTGGAA ACCGGTATCA AAGTTATC G A CCTGATGTGT CCGTTCGCTA 3 50 

AGGGCGGTAA AGTTGGTCTG TTCGGTGGTG CGGGTGTAGG TAAAACCGTA 400 

AACATGATGG AGCTCATTCG TAACATCGCG ATCGAGCACT CCGGTTACTC 450 

TGTGTTTGCG GGCGTAGGTG AACGTACTCG TGAGGGGAAC GACTTCTACC 500 

55 ACGAAATGAC CGACTCCAAC GTTATCGATA AAGTATCCCT GGTGTATGGC 550 

CAGATGAACG AGCCGCCGGG AAACCGTCTG CGCGTAGCTC TGACCGGTCT 600 

GACCATGGCT GAGAAATTCC GTGACGAAGG TCGTGACGTT CTGCTGTTCG 65 0 

TTGACAACAT CTATCGTTAC ACCCTGGCCG GTACGGAAGT ATCCGCACTG 7 00 

CTGGGCCGTA TGCCTTCAGC GGTAGGTTAT CAGCCGACCC TGGCGGAAGA 7 50 

60 GATGGGCGTT CTGCAGGAAC GTATCACCTC CACCAAAACC GGTT 7 94 
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2 ) INFORMATION FOR SEQ ID NO : 94 3 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNE S S : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 943 

20 ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGC AAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTACGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATTGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 2 00 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 2 50 

25 AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 300 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

30 AACGACTTCT AC C ACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

"CCTGGTGTAT "GGCCAGATGA ~ACGAGCCGCC~~GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG C CGGTAC GGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 7 50 

35 CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AAC GTATC AC CTCCACCAAA 800 

ACTGGTTCTA TCAC 814 



40 2) INFORMATION FOR SEQ ID NO: 944 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 944 



GTGTACGATG CTCTTGAGGT GCAAAATGGT AATGAGCGTC TGGTGCTGGA 50 

AGTTCAGCAG CAGCTCGGCG GCGGTATCGT GCGT AC CATC GCAATGGGTT 100 

CCTCCGACGG TCTGCGTCGC GGTCTGGATG TAAAAGACCT CGAACACCCG 15 0 

ATCG AAGTC C CGGTAGGTAA AGCGACTCTG GGCCGTATCA TGAACGTACT 200 

60 GGGTGAACCG GTCGACATGA AAGGC GAG AT CGGTGAAGAA GAGCGTTGGG 25 0 
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CGATTCACCG CGCAGCACCT TCCTACGAAG 
CTGCTGGAAA CCGGTATCAA AGTTATCGAC 
GGGCGGTAAA GTTGGTCTGT TCGGTGGTGC 
ACATGATGGA GCTTATTCGT AACATCGCGA 
5 GTGTTTGCGG GCGTAGGTGA ACGTACTCGT 
CGAAATG AC C GACTCCAACG TTATCGACAA 
AGATGAAC G A GCCGCCGGGA AACCGTCTGC 
ACCATGGCTG AGAAATTCCG TGACGAAGGT 
TGACAACATC TATCGTTACA CCCTGGCCGG 
10 TGGGCCGTAT GCCTTCAGCG GTAGGTTATC 
ATGGGCGTTC TGCAGGAACG TATCACCTCC 



AGCTGTCAAA CTCTCAGGAA 3 00 

CTGATGTGTC CGTTCGCTAA 3 50 

GGGTGTAGGT AAAACCGTAA 4 00 

TCGAGCACTC CGGTTACTCT 4 50 

GAGGGTAACG ACTTCTACCA 500 

AGTATCCCTG GTGTATGGCC 550 

GCGTTGCTCT GACCGGTCTG 600 

CGTGACGTTC TGCTGTTCGT 650 

TACGGAAGTA TCCGCACTGC 7 00 

AGCCGACCCT GGCGGAAGAG 750 

ACCAAAACCG GTTCTATC 7 98 



15 2 ) INFORMATION FOR SEQ ID NO: 945 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945 



"ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 2 00 

35 ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 2 50 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC GCGATCGAGC 450 

40 ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CTTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 60 0 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTC GTG AC 65 0 

GTATTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACCGA 700 

45 AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 80 0 

ACCGGTTCTA TC 812 



50 

2 } INFORMATION FOR SEQ ID NO: 946 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 832 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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CA 02307010 2000-05-19 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 946 



GCGACGCTAT 
CTGACTCTGG 
TGCAATGGGT 

10 CTGGTGCGCC 
GTCGACGTAT 
CAAGAGCCGT 
CAACTACCGA 
CCGTTTGCTA 

15 CAAAACCGTG 

GCGGTCTGTC 

GACTTCTACC 
GGTTTACGGT 

TGACCGGTTT 
20 GGTAAAGGTC 
TCTGGCCGGT 
TGGGTTACCA 
ATTACCTCTA 



CCCGCATGTT 
AAGTTCAACA 
AGTTCAGACG 
AATCACTGTG 
TGGGTACGCC 
GCCATTCACC 
ATTGTTGGAA 
AAGGC GGTAA 
AACATGATGG 
C GTGTTCGC A 
ACGAGATGAA 
CAGATGAACG 
GACCATGGCG 
GCGACGTATT 
ACCGAAGTAT 
ACCGACATTG 
CCCAAACCGG 



TACGATGCCC 
AC TTC TGGGT 
GCCTGAAACG 
CCGGTAGGTA 
TGTTGATGAA 
AAACTGCTCC 
ACCGGTATTA 
AGTAGGTCTG 
AATTGATCAA 
GGTGTGGGCG 
AGATTCCAAC 
AACCTCCGGG 
GAATAC TTC C 
GTTCTTCGTT 
CTGCACTGTT 
GCTGAAGAAA 
TTC C ATTACT 



TGAAATTGGA 
GACGGCGTTG 
CGGCATGTCT 
AAGGTAC TTT 
GCAGGTCCGA 
GAAATTC GAC 
AAGTGATCGA 
TTCGGTGGTG 
CAACATCGCC 
AGCGTACCCG 
GTATTGGATA 
CAACCGTTTG 
GTGACGAAAA 
GACAACATCT 
GGGCCGTATG 
TGGGTCGTTT 
TC 



CGAGAACGGT 50 

TCCGTACTAT 100 

GTAAGC AATA 150 

GGGTCGTATT 200 

TCGATACCGA 250 

GAGTTGTCTG 300 

CTTGCTGTGT 350 

CCGGTGTAGG 400 

AAAGCGCACA 450 

TGAAGGTAAC 500 

AAGTGGCAAT 550 

CGCGTCGCAT 600 

AGACGAAAAC 650 

ACCGTTACAC 7 00 

CCTTCTGCAG 750 

GCAAGAGCGT 800 
832 



25 



30 



35 



40 



45 



50 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 947 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 84J) bases 
~JB) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 947 



TCCGCGCGAT 
ACGGCCTGAC 
ACTATTGCAA 
CAATACAGAT 
GTATTGTCGA 
ACCGACAAAC 
GTCTGCTACT 
TGTGTCCGTT 
GTAGGCAAAA 
GCATAGTGGT 
GTAACGACTT 
GCGATGGTTT 
AGCCTTGACC 
AAAGCGGCAA 
TACACTCTGG 
AGCAGTAGGT 
AGCGTATTAC 



GCCATTCCGC 
TTTGG AAG T A 
TGGGTAGTTC 
GCGCCGATTA 
TGTGTTGGGT 
ACCGTGCTAT 
ACCGAGCTGC 
TGCCAAAGGC 
CCGTCAACAT 
TTGTCCGTGT 
CTACCACGAG 
ACGGTCAGAT 
GGTTTG AC G A 
AGGTCGCGAC 
CCGGTACAGA 
TACCAACCGA 
CCTCTACTCA 



ATGTTTACGA 
CAACAGCTTC 
GGACGGTCTG 
CTGTGCCGGT 
ACACCTGTTG 
CCATCAGACA 
TGGAAACAGG 
GGTAAAGTAG 
GATGGAATTG 
TCGCCGGTGT 
ATGAAAGATT 
GAACGAACCT 
TGGCCGAATA 
GTATTGTTCT 
AGTATCCGCA 
CATTGGCTGA 
AACAGGCTCC 



CGCCCTGAAA 
TGGGCGACGG 
AAACGCGGCA 
AGGTAAAGGT 
ATGAAGCAGG 
GCTCCGAAAT 
CATTAAAGTG 
GTCTGTTCGG 
ATTAACAACA 
GGGGGAACGT 
CCAACGTATT 
CCGGGTAACC 
CTTCCGTGAT 
TCGTGGACAA 
TTGCTCGGTC 
AGAAATGGGT 
ATTACTTCTA 



TTGGATGCAA 50 

CGTGGTTCGT 100 

TGACTGTAAG 150 

ACTTTGGGAC 200 

TCCGATTGAT 25 0 

TCGATGAGTT 300 

ATTGACTTGC 350 

TGGTGCCGGT 400 

TCGCCAAAGC 450 

ACCCGTGAAG 500 

GGACAAAGTG 550 

GTCTGCGTGT 600 

G AAAAAG AC G 650 
CATTTACCGT . 7 00 

GTATGCCTTC 750 

CGTCTGCAAG 800 
840 
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2 ) INFORMATION FOR SEQ ID NO: 948 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 948 



GCAGCTGGCG 

2 0 AAATGACGAA 

GTGATGGAGT 
CGTGGCATGG 
CAAAGAAACA 
TGGATGCTCC 
25 GCTCCAACCT 
TATCAAGGTT 
GACTCTTCGG 
ATCCACAACA 
TGGGGAACGT 

3 0 CTGGTGTTAT 

CCAGGAGCGC 
CTTCCGTGAT 
TCCGTTTCAC 
CCGTCAGCCG 
3 5 GCAAGAGCGT 



ACAAGCTACC 
AAAAAATCAA 
GGTTCGGACC 
AAGTGC TAGA 
CTTGGTCGCG 
TTTTG CGGAT 
TTGATGAGTT 
ATCGACCTGT 
TGGTGCCGGA 
TTGCCCAAGA 
ACCCGTGAAG 
C G AG AAAAC A 
GTATGCGGGT 
GTGGAAGGTC 
GCAGGCAGGT 
TTGGTTAC C A 
ATCACATCGA 



TGAGATCAAT 
AAATCGTCCT 
ATCGCTATGG 
TACTGGCCGT 
TCTTTAACGT 
GATGCAGAGC 
GTCTACTTCT 
TAGCCCCTTA 
GTTGGTAAGA 
ACACGGTGGT 
GGAATGACCT 
GCCATGGTCT 
TGCTTTGACT 
AAGATGTCTT 
TCTGAAGTTT 
ACCAACACTT 
CTAAGAAGGG 



ID NO: 948 






AATGCACTTG 


TAGTCTATAA 


50 


TGAAGTAGCT 


CTTGAGCTTG 


100 


AATCAACGGA 


TGGGTTGACT 


150 


CCAATTTCTG 


TGCCAGTCGG 


200 


TTTGGGAGAT 


ACCATTGACT 


250 


GCCAGCCAAT 


CCATAAGAAA 


300 


TCAGAGATCT 


TAGAGACAGG 


350 


TCTGAAAGGT 


GGTAAAGTTG 


400 


CCGTCCTGAT 


TCAAGAATTG 


450 


ATTTCTGTAT 


TTACTGGCGT 


500 


TTATTGGGAA 


ATGAAAGAGT 


550 


TCGGTCAGAT 


GAATGAGCCG 


600 


GGTTTGACGA 


TTGCAGAATA 


650 


GCTCTTCATT 


GACAACATCT 


700 


CTGCCCTTTT 


GGGTCGGATG 


750 


GCGACAGAAA 


TGGGGCAATT 


800 


TTC TGTAAC C 


TCT 


843 
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2 ) INFORMATION FOR SEQ ID NO: 949 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 0 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 949 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 5 0 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GAGATGGTAT GGTCCGTACT ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 2 00 

TAAAGAAACT TTGGGACGTG TCTTCAATGT TTTGGGAGAT ACCATTGACT 25 0 

60 TGGAAGCTCC TTTTACAGAA GATGCAGAGC GTCAGCCAAT TCATAAAAAA 300 

731 
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GCTCCAACTT TTGATGAATT GTCTACCTCT 
GATTAAGGTT ATCGACCTTC TTGCCCCTTA 
GACTTTTCGG TGGTGCCGGA GTTGGTAAAA 
ATTCACAACA TTGCCCAAGA ACACGGTGGT 
5 TGGGGAACGT ACTCGTGAGG GTAATGACCT 
CAGGTGTTAT CGAGAAAACA GCCATGGTAT 
CCAGGAGCAC GTATGCGTGT TGCCCTAACT 
CTTCCGTGAT GTGGAAGGCC AAGACGTGCT 
TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT 
10 CCATCAGCCG TTGGTTACCA ACCAACACTT 
GCAAGAGCGT ATTACATCAA CTAAAAAGGG 



TCTGAAATCC TTGAAACAGG 350 

CCTTAAAGGT GGTAAGGTTG 400 

CCGTCTTAAT CCAAGAATTG 450 

ATTTCAGTAT TTACCGGTGT 500 

TTACTGGGAA ATGAAAGAAT 550 

TTGGTCAGAT GAATGAGCCG 600 

GGTTTGACAA TCGCCGAATA 650 

TCTCTTTATC G AT AATATC T 700 

CTGCCCTTTT GGGTCGTATG 750 

GCTACGGAAA TGGGTCAATT 800 

TTCTGTAACC T 841 



15 2 ) INFORMATION FOR SEQ ID NO: 950 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

2 0 ( C ) STRANDEDNESS : Doubl e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitls 

(B) STRAIN: LSPQ 2583 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 950 



"GCTACCTGAG ATCAATAATG C AC TTGTAGT CTATAAAAAT G AC G AAAAT A 50 

AATCAAAAAT CGTCCTTGAA GTAGC TCTTG AGCTTGGTGA TGGAGTGGTT 100 

CGGACCATCG CTATGGAATC AACGGATGGG TTGACTCGTG GCATGGAAGT 150 

GCTAGATACT GGTCGTCCAA TTTCTGTGCC AGTCGGCAAA GAAACACTTG 200 

35 GTCGCGTCTT TAACGTTTTG GGAGATACCA TTGACTTGGA TGCTCCTTTT 250 

GCGGATGATG CAGAGCGCCA GCCAATCCAT AAGAAAGCTC CAACCTTTGA 300 

TGAGTTGTCT AC TTC ATC AG AGATCTTAGA GACAGGTATC AAGGTTATCG 35 0 

ACCTGTTAGC ACCTTATCTG AAAGGTGGTA AAGTCGGACT CTTCGGTGGT 400 

GCCGGAGTTG GTAAGACCGT CC TGATTC AG GAATTGATCC ACAACATTGC 450 

40 CCAAGAGCAT GGTGGTATTT CCGTGTTTAC CGGTGTTGGG G AAC GT AC CC 500 

GTGAAGGGAA TGACCTTTAC TGGGAAATGA AGGAGTCTGG CGTTATCGAG 550 

AAAACAGCCA TGGTCTTCGG TCAGATGAAT GAGCC AC C AG GAGCGCGTAT 600 

GCGGGTTGCT TTGACTGGTT TGACGATTGC AGAGTACTTC CGTGATGTAG 650 

AAGGTCAAGA TGTCTTGCTC TTCATTGACA ACATCTTCCG TTTCACGCAG 700 

45 GCAGGTTCTG AAGTCTCTGC CCTTTTGGGT CGGATGCCAT CAGCCGTTGG 7 50 

TTACCAACCA ACACTTGCGA CTGAAATGGG ACAACTCCAA GAGCGTATTA 800 

CATC G AC TAA GAAAGGTTCT GTAACTT 827 



50 



60 



2) INFORMATION FOR SEQ ID NO: 951 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 844 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

732 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951 

GCAGCAGGGG AAACACTTCC TGAGATTAAT AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GTGATGGTAT GGTCCGTACG ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

10 CGTGGAATGG AAGTTTTGGA CACAGGCCGT CCAATCTCTG TACCAGTAGG 2 00 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACTATTGACT 2 50 

TGGATGCTCC TTTCGCTGAA GACGCTGAGC GTCAGCCAAT TCATAAGAAA 3 00 

GCTCCAACTT TTGATGAATT GTCTACCTCA TCTGAAATCT TGGAAACAGG 350 

GATTAAGGTT ATCGACCTTC TTGCCCCTTA CCTTAAAGGT GGGAAAGTTG 4 00 

15 GACTCTTCGG TGGTGCCGGA GTTGGTAAAA CTGTCTTGAT CCAAGAGTTG 45 0 

ATTCACAACA TTGCCCAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 5 00 

TGGAGAACGT ACCCGTGAGG GGAACGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTAT TTGGTCAGAT GAATGAGCCA 600 

C C TGGAGC AC GTATGCGTGT TGCTCTTACT GGTTTGACAA TCGCCGAATA 65 0 

2 0 CTTCCGTGAT GTAGAAGGCC AAGATGTGCT TCTCTTTATC GACAATATCT 7 00 

TC CGTTTC AC TCAAGCTGGT TCAGAAGTAT . CTGCCCTTTT GGGTCGGATG 7 50 

CCTTCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CTAAGAAGGG TTCTGTAACC TCTA 844 



25 



40 
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2 ) INFORMATION FOR SEQ ID NO: 952 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 83 0 bases 

(B) TYPE : Nucleic acid — 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952 



GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 5 0 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

45 GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTG AC T 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT 2 50 

TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 3 00 

GCTCCAACTT TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG 3 50 

50 GATCAAGGTT ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG 400 

GACTTTTCGG TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG 450 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT 5 00 

TGGGGAACGT ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA 600 

5 5 CCAGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA 650 

CTTCCGTGAT GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT 7 00 

TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG 750 

CCATCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 8 00 

GCAAGAACGT ATCACATCAA CCAAGAAGGG 83 0 



733 
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10 



2 ) INFORMATION FOR SEQ ID NO: 953 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
15 (B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953 

AAAAACTTCC TGAGATTAAC AATGC AC TTG TCGTCTACAA AAATGACGAA 50 

2 0 AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 2 00 

TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 25 0 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 300 

25 TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 350 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

3 0 CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 7 00 

TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTT AC CA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 800 

35 ATCACATCAA CTAAGAAGGG TTC 823 



2 ) INFORMATION FOR SEQ ID NO: 954 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 954 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGC AC TTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 200 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGACT 250 

60 TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 3 00 

734 
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GCTCCAACTT TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG 350 

GATCAAGGTT ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG 400 

GACTTTTCGG TGGTGCCGGA GTTGGTAAAA CTGTCTTAAT CCAAGAATTG 45 0 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTGCTGGTGT 500 

5 TGGGGAACGT ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA 600 

CCAGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA 65 0 

CTTCCGTGAT GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT 700 

TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG 7 50 

10 CCATCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CCAAGAAGGG TTCTGTAACC TCTA 84 4 



15 2 ) INFORMATION FOR SEQ ID NO: 955 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955 



AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 200 

35 TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 250 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 3 00 

TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 3 50 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 400 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

40 TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 55 0 

CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 65 0 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

45 TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 800 

ATCACATCAA CCAAGAAGGG TTCTGTAACC TCTA 834 



50 

2 ) INFORMATION FOR SEQ ID NO : 956' 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 495 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microti 

(B) STRAIN: Per sing- 1 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956 

TTGTATATCA CAGGCACTCA GCAAATATTC CGATACTGAC GTAATTATAT 50 

ACGTGGGTTG TGGTGAACGT GGGAATGAAA TGGCTGAGAT TCTATGCGAA 100 

TTCCCTGAAC TATCTACTGT AGTTAATGAT GAAAAGGTGG CCATTATGGA 150 

10 ACGTACATGC TTAGTTGCCA ATACTTCTAA TATGCCAGTG GC C GC TAG AG 200 

AAGCTAGTAT AT AC AC TGGT ATTACAATTG CTGAATATTT CCGTGATATG 250 

GGTTACAACT GCACTCTTAT GGCCGATTCC ACTAGCCGAT GGGCAGAGGC 300 

TCTAAGGGAA ATTTCTGGTA GATTGGCTGA AATGCCTGCA GATTCTGGCT 3 50 

ATCCGGCCTA TTTATC GTC A AGGTTGTCAG CTTTTTATGA ACGTGCAGGT 400 

15 GGGATAACTG TCTAATTAAT TTAGGCTTGA TTAAGTGCTT AGGTTCACCA 45 0 

ACACGAACCG GATCTATTAC GGTTGTAGGA GCAGTTTCTC CACCA 495 



20 2 ) INFORMATION FOR SEQ ID NO: 957 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 

(B) STRAIN: HM1-1MSS 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957 



AGTTATTTCA CAAGCATTAA GTAAATATAG TAATTCAGAT GTTATTATTT 50 

ATGTAGGATG TGGTGAACGA GGAAATGAAA TGGCAGAAGT TCTTCGAGAT 100 

TTTCCAGCTC TTTC TATTAA AG TAGG AGAT AAAGAAGAAT CTATTATGAC 150 

AAGAACAGCA CTTGTTGCTA ATACATCTAA TATGCCTGTT GCAGCACGTG 200 

40 AAGCATCAAT TTATACTGGA ATTACATTAT CAGAATATTA TAGAGATATG 2 50 

GGATATAATG TTGCTATGAT GGCAGATTCA ACATCAAGAT GGGCTGAAGC 3 00 

ACTTAGAGAA ATTTCAGGAC GTCTTGCAGA AATGCCAGCT GATTCTGGAT 3 50 

ATCCAGCATA TCTTGCAGCA CGTTTAGCAT CATTTTATGA ACGTGCAGGT 400 

ATGGTTGAAT GTTTAGGATC ACCAAAAAGA ATAGGGTCAG TTTCTATTGT 450 

45 AGGAGCTGTT TCACCACCT 469 



50 



2 ) INFORMATION FOR SEQ ID NO : 958 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 452 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Fusobac terium nucleatum subsp. polymorphum 

736 
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(B) STRAIN: ATCC 10953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 958 

5 TACAACACCA ACTTGCTAAA TGGGCAGATG CAGAAGTAGT TGTTTATGTT 50 

GGTTGTGGGG AACGTGGAAA TGAAATGACC GATGTACTTA TGGAATTCCC 100 

AGAAATTATT GACCCTAAGA CAGGACAATC TTTAATGAAG AGAACAGTTC 150 

TTATAGCTAA TACTTCTAAT ATGCCAGTTG CTGCTCGTGA GGCTTCAATC 200 

TATAC TGGTA TAACTATTGC AGAATATTTT AGAGATATGG GATATTCAGT 2 50 

10 GGCACTTATG GCAGATTCAA CAAGTCGTTG GGCAGAAGCA CTTCGTGAAA 3 00 

TGTCAGGACG TTTGGAAGAA ATGCCAGGTG ATGAAGGATA TCCAGCATAT 350 

CTATCAAGTA GAATAGCAGA GTTTTATGAA AGAGCAGGGC TTGTTGAATG 400 

TCTAGGTAAT GGAGAAGAAG GAGCATTAAC TGTAATTGGA GCAGTATCTC 450 

CA 452 

15 

2) INFORMATION FOR SEQ ID NO: 959 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethioplca 
30 (B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

3 5 ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCAATGGT CGCGAGGAGT CGATCATGAA 15 0 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 250 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 3 00 

40 GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA GATGCCGGCC GATGGTGGCT 350 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 400 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 45 0 

CGGTGCCGTG TCTCCGCCG 469 



2 ) INFORMATION FOR SEQ ID NO : 960 



(i) SEQUENCE CHARACTERISTICS: 
5 0 (A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30815 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC 
ATGTCGGCTG CGGCGAGCGC GGTAATGAGA 
5 TTCCCGACCC TGACGACCGT GATCGATGGT 
GCGCACCTGC CTCGTGGCGA ACACTTCGAA 
AGGCCTCTAT TTACACCGGC ATCACCCTGG 
GGCAAGCATA TCGCCATGAT GGCCGACTCG 
GCTTCGTGAG ATTTCGGGTC GTTTGGCGGA 
10 ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT 
CTCGTCACCT GCATCGGCGG GCCGAAGCGC 
CGGTGCCGTG TCTCCGCCG 



ID NO: 960 

CAACTCCGAC TGCGTCATCT 50 

TGGCCGAGGT GCTCATGGAG 100 

CGCGAGGAGT CGATCATGAA 150 

CATGCCAGTC GCAGCCCGTG 2 00 

CCGAGTACTA CCGTGATATG 2 50 

ACGTCTCGCT GGGCCGAGGC 3 00 

GATGCCGGCC GATGGTGGCT 350 

CCTTCTACGA GCGCGCCGGC 400 

CAGGGCTCCG TCACGATTGT 450 

469 



15 



25 



2 ) INFORMATION FOR SEQ ID NO : 961 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 469 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

(B) STRAIN : ATCC 50126 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961 

TGTCATCAGT CAGGCCCTCT CCAAGTACTC CAACTCGGAC TGTGTCATCT 50 

ATGTCGGCTG CGGTGAACGC GGTAACGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGAC TGT GATCGATGGT CGCGAAGAGT CCATCATGAA 150 

35 GCGCACCTGC CTCGTGGCGA ACACCTCGAA CATGCCCGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTAT AC CGGC ATCACCCTTG CTGAGTACTA CCGTGATATG 2 50 

GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

GCTGCGTGAG ATTTCGGGTC GATTGGCGGA GATGCCGGCT GATGGTGGCT 350 

ACCCTGCCTA CCTCAGCGCC CGCCTCGCCT CCTTCTACGA GCGCGCCGGT 400 

40 CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATCGT 450 

CGGTGCAGTG TCTCCACCG 4 69 



45 2) INFORMATION FOR SEQ ID NO: 962 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962 
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TGTCATTAGT CAGGCCCTCT CCAAGTACTC 
ATGTCGGCTG CGGCGAGCGC GGTAATGAGA 
TTCCCGACCC TGACGACCGT GATCGATGGC 
GCGCACCTGC CTCGTGGCGA ACACCTCGAA 
5 AGGCCTCTAT TTACACCGGC ATCACCCTGG 
GGCAAGCATA TCGCCATGAT GGCTGACTCG 
GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA 
ACCCCGCCTA CCTCAGCGCT CGTCTCGCCT 
CTCGTCACCT GCATCGGCGG GCCGAAGCGC 
10 CGGTGCCGTG TCTCCACCG 



CAACTCCGAT TGCGTCATCT 50 

TGGCCGAGGT GCTCATGGAG 100 

CGCGAGGAGT CGATCATGAA 150 

CATGCCAGTC GCAGCCCGTG 2 00 

CCGAGTACTA CCGTGATATG 2 50 

ACGTCTCGCT GGGCCGAGGC 3 00 

GATGCCGGCC GATGGTGGCT 350 

CCTTCTACGA GCGCGCCGGC 400 

CAGGGCTCCG TCACGATCGT 450 

469 



15 



2 ) INFORMATION FOR SEQ ID NO: 963 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 5 (A) ORGANISM: Lelshmanla hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963 

30 TGTGATCAGT CAGGCCCTTT CCAAGTACTC CAACTCGGAC TGCGTCATCT 50 

ACGTCGGCTG TGGTGAGCGC GGGAACGAGA TGGCCGAGGT GCTCATGGAT 100 

TTCCCGACTT TGAC GACTGT GATCGATGGT CGCGAGGAGT CCATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACCTCCAA CATGCCAGTT GCAGCCCGTG 200 

AGGCTTCTAT CTATACCGGC ATCACACTGG CTGAGTACTA TCGTGATATG 2 50 

35 GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 3 00 

GTTGCGTGAG ATTTCCGGTC GGCTGGCGGA GATGCCGGCC GATGGTGGTT 3 50 

ACCCCGCCTA CCTCAGTGCC CGTCTCGCCT CCTTCTACGA GCGCGCTGGC 400 

CTCGTGACCT GTATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACAATTGT 450 

TGGTGCGGTG TCTCCACCG 4 69 

40 



2 ) INFORMATION FOR SEQ ID NO: 964 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanla mexlcana 
55 (B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964 

GGTCATTAGT CAGGCCCTTT CCAAGTACTC CAACTCTGAC TGCGTCATCT 50 
60 ACGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 
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TTCCCGACCC 
GCGCACCTGC 
AGGC CTCTAT 
GGCAAGCATA 
5 GCTTCGTGAG 
ACCCCGCCTA 
CTCGTCACCT 
CGGTGCCGTG 



TG AC G AC C AT 
CTCGTGGCGA 
CTACACCGGC 
TCGCCATGAT 
ATTTCGGGTC 
CCTCAGCGCT 
GCATCGGCGG 
TCTCCGCCG 



GATCGATGGT 
ACACCTCGAA 
ATCACCCTCG 
GGCCGACTCG 
GTCTGGCGGA 
CGTCTCGCCT 
GCCGAAGCGC 



CGGGAGGAGT 
CATGCCCGTC 
CCGAGTACTA 
ACGTCTCGCT 
GATGCCGGCC 
CCTTCTACGA 
CAGGGCTCCG 



CGATCATGAA 150 

GCAGCCCGTG 2 00 

CCGTGATATG 2 5 0 

GGGCTGAGGC 3 00 

GATGGTGGCT 35 0 

GCGCGCCGGC 400 

TCACGATCGT 450 
469 



10 



15 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 965 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 965 



TGTAATTAGT 
ATGTCGGCTG 

3 0 TTCCCGACCC 
GCGCACCTGC 
AGGCCTCTAT 
GGCAAGCATA 
GCTTCGTGAG 

3 5 ACCCTGCCTA 
CTCGTCACCT 
CGGTGCCGTG 



CAGGCCCTCT 
CGGCGAGCGC 
TGACGACCGT 
CTCGTGGCGA 
TTACACCGGC 
TCGCCATGAT 
ATTTCGGGTC 
TCTCAGCGCT 
GCATCGGCGG 
TCTCCGCCG 



CCAAGTACTC 
GGTAATGAGA 
GATCGATGGT 
-ACACTTCGAA 
ATCACCCTGG 
GGCCGACTCG 
GTTTGGCGGA 
CGTCTCGCCT 
GCCGAAGCGC 



CAACTCCGAC 
TGGCCGAGGT 
CGCGAGGAGT 
CATGCCAGTC 
CCGAGTACTA 
ACGTCTCGCT 
GATGCCGGCC 
CCTTCTACGA 
CAGGGCTCCG 



TGCGTCATCT 50 

GCTCATGGAG 100 

CGATCATGAA 150 

GCAGCCCGTG 200 

CCGTGATATG 250 

GGGCCGAGGC 3 00 

GATGGTGGCT 3 50 

GCGCGCCGGC 400 

TCACGATTGT 45 0 
469 



40 



50 



2 ) INFORMATION FOR SEQ ID NO: 966 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 449 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaeroblus 

(B) STRAIN: ATCC 273 37 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966 

CACCAGTTCG CCAAGTGGGC AGATGCTCAG ATAGTTGTAT ACGTTGGTTG 50 

TGGAGAACGT GGTAACGAGA TGACAGACGT TCTAAATGAA TTCCCAGAAC 100 

TGATTGACCC TCATACAGGC GAATCTCTAA TGAAGAGAAC AGTTCTTATA 150 

60 GCTAATACGT CAAATATGCC AGTTGCAGCC AGAGAGGCAA GTATATATAC 200 
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AGGTATTACA ATAGCTGAAT ATTTTAGAGA 
T AATGGCC G A CTCTACATCA AGATGGGCAG 
GGTCGTCTAG AAGAAATGCC TGGTGATGAA 
TTCTAGAGCT GCAGAGTTCT ATGAAAGAGC 
5 GTAGCGATAA TAGAGAGGGA GCCCTTACAA 



CATGGGATAT TCAGTAGCGG 250 

AGGCCCTAAG AG AGATGTC A 300 

GGTTATCCAG CCTATCTAGG 350 

AGGTAAGGTA ATATGTAAGG 400 

TAATAGGTGC CGTGTCACC 449 



2 ) INFORMATION FOR SEQ ID NO: 967 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

<vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Bordetella pertussis 

(B) STRAIN: ATCC 9797 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967 

25 CTGGTGGTGT CGGCCGCAGA CGGCCCGATG CCGCAGACGC GCGAGCACAT 50 

TTTGCTGTCG CGCCAGGTTG GCGTGCCGTA CATCATCGTG TTCCTGAACA 100 

AGGCGGACAT GGTTGATGAC GCGGAGCTGC TCGAGCTGGT GAAGATGGAA 150 

GTCCGCGAAC TGCTGAGCAA GTACGATTTC CCGGGCGATG ACACGCCGAT 200 

CGTGAAGGGT TCGGCCAAGC TGGCGCTGGA AGGCGACAAG GGCGAACTGG 250 

3 0 GCGAGCAGGC GATTCTGTCG CTGGCGCAAG CGC TGG AC AC GTACATTCCG 3 00 

ACGCCGGAGC GCGCGGTCGA CGGTGCGTTC CTGATGCCGG TGGAAGACGT 350 

GTTCTCGATC TCGGGCCGTG GCACGGTGGT GACTGGCCGT ATCGAGCGCG 400 

GCGTGGTGAA GGTTGGCGAG GAAATCGAAA TCGTGGGCAT CAAGCCGACG 45 0 

GTGAAGACGA CCTGCACGGG CGTGGAGATG TTCCGCAAGC TGCTGGACCA 500 

35 GGGCCAGGCG GGCGACAACG TGGGTATCTT GCTGCGCGGC ACCAAGCGTG 550 

AAGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGTTC GATCAACCCG 600 

CACACGGACT TCACGGCCGA GGTGTACATT CTGTCCAAGG AAGAGGGTGG 650 

CCGTCACACG CCGTTCTTCA ACGGCTATCG TCCGCAGTTC TACTTCCGCA 700 

CGACGGACGT GACCGGCACG ATCGACCTGC CGGCGGACAA GGAAATGGTG 750 

40 CTGCCGGGCG ACAACGTGTC GATGACCGTC AAGCTGCTGG CCCCGATCGC 800 

CATGGAAGAA GGTCTGCGTT TCGCCA 82 6 



45 2 ) INFORMATION FOR SEQ ID NO : 968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
50 <C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA . 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(B) STRAIN: BD180 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 968 
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CGATCCTGGT GGTGTCGGCC GCAGACGGCC 
CACATTTTGC TGTCGCGCCA GGTTGGCGTG 
GAACAAGGCG GACATGGTTG ATGACGCGGA 
TGGAAGTCCG CGAACTGCTG AGCAAGTACG 
5 CCGATCGTGA AGGGTTCGGC CAAGCTGGCG 
ACTGGGCGAG CAGGCGATTC TGTCGCTGGC 
TTCCGACGCC GGAGCGCGCG GTCGACGGTG 
GACGTGTTCT CGATCTCGGG CCGTGGCACG 
GCGCGGCGTG GTGAAGGTTG GCGAGGAAAT 

10 CGACGGTGAA GACG AC CTGC ACGGGCGTGG 
GACCAGGGCC AGGCGGGCGA CAACGTGGGT 
GCGTGAAGAC GTCGAGCGTG GCCAGGTGCT 
ACCCGCACAC GGACTTCACG GCCGAGGTGT 
GGTGGCCGTC ACACGCCGTT CTTCAACGGC 

15 CCGCACGACG GACGTGACCG GCACGATCGA 
TGGTGCTGCC GGGCGACAAC GTGTCGATGA 
ATCGCCATGG AAGAAGG 



CGATGCCGCA GACGCGCGAG 50 

CCGTACATCA TCGTGTTCCT 100 

GCTGCTCGAG CTGGTGGAGA 150 

ATTTCCCGGG CGATGACACG 2 00 

CTGGAAGGCG ACAAGGGCGA 2 50 

GCAAGCGCTG GACACGTACA 300 

CGTTCCTGAT GCCGGTGGAA 350 

GTGGTGACTG GCCGTATCGA 400 

CGAAATCGTG GGCATCAAGC 450 

AGATGTTCCG CAAGCTGCTG 500 

ATCTTGCTGC GCGGCACCAA 550 

GGCCAAGCCG GGTTCGATCA 600 

ACATTCTGTC CAAGGAAGAG 650 

TATCGTCCGC AGTTCTACTT 700 

CCTGCCGGCG GACAAGGAAA 750 

CCGTCAAGCT GCTGGCCCCG 800 

817 



20 

2 ) INFORMATION FOR SEQ ID NO: 969 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 637 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus columbae 

(B) STRAIN: ATCC 51263 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 969 

CCTATGCCAC AAACTCGTGA ACACATTCTT TTATCACGTA ACGTTGGTGT 50 

GCCATACATC GTTGTTTTCT TAAACAAAGT TGATATGGTT GACGACGAAG 100 

AATTATTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT AACTGAATAT 150 

40 GACTTCCCAG GAGACGATGT TCCTGTAATC GCTGGTTCTG CATTAAAAGC 200 

TTTAGAAGGC GACCCTGCTT ACGAAGAAAA AATCTTAGAA TTAATGGCTG 250 

CAGTTGACGA ATACATCCCA ACTCCAGAAC GTGACAACGA CAAACCATTC 3 00 

ATGATGCCAG TTGAAGACGT GTTCTCAATT ACTGGTCGTG GTACTGTTGC 350 

TACAGGTCGT GTTGAACGTG GACAAGTTCG TGTTGGTGAC GAAGTTGAAA 400 

45 TCGTTGGTAT CGCTGACGAA ACTTCTAAAA CAACAGTTAC TGGTGTTGAA 450 

ATGTTCCGTA AATTATTAGA TTACGCTGAA GCTGGAGACA ACATCGGTGC 500 

ATTATTACGT GGTGTGGCTC GTGAAGACAT CCAACGTGGT CAAGTATTAG 550 

CTAAAC C AGG TTCAATCACT CCACATACAA AATTCACTGC TGAAGTGTAC 600 

GTTTTAACTA AAGAAGAAGG TGGACGTCAT ACTCCAT 637 

50 



2) INFORMATION FOR SEQ ID NO: 970 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 49997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 0 

TATGCCTCAA ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC 5 0 

10 CATACATCGT TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTACTAGAAT TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA 150 

CTTCCCAGGC GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC 200 

TTGAAGGCGA TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA 250 

GTTGACGAAT ACGTTCCAAC TCCAGAACGT GAC AC TG AC A AACCATTCAT 300 

15 GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 35 0 

CAGGCCGTGT TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC 400 

GTTGGTATTG CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT 450 

GTTCCGTAAA TTGTTAGACT ATGCTGAAGC AGGGGATAAC ATTGGTGCAT 500 

TGCTACGTGG GGTTGCTCGT GAAGACATCC AACGTGGACA AGTATTAGCT 550 

2 0 AAAGCTGGTA CAATCACACC TCATACAAAA TTTAAAGCTG AAGTTTACGT 600 

TTTAACAAAA GAAGAAGGTG GACGTCACAC TCCA 63 4 



25 2 ) INFORMATION FOR SEQ ID NO: 971 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 

3 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 971 



GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC GTCAGGTTGG 50 

TGTTAAACAC CTTATCGTCT TCATGAACAA AGTTGACTTG GTTGACGACG 100 

AAG AATTGC T TGAATTGGTT GAAATGGAAA TCCGTGACCT ATTGTCAGAA 150 

TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 200 

45 AGCTCTTGAA GGTGACTCTA AAT AC G AAG A CATCGTTATG GAATTGATGA 250 

ACACAGTTGA TGAGTATATC CCAGAACCAG AAC GTGAC AC TGACAAACCA 3 00 

TTGCTTCTTC CAGTCGAGGA CGTATTCTCA ATCACTGGAC GTGGTACAGT 350 

TGCTTCAGGA CGTATCGACC GTGGTATCGT TAAAGTCAAC GACGAAATCG 400 

AAATCGTTGG TATCAAAGAA GAAACTCAAA AAGCAGTTGT TACTGGTGTT 450 

5 0 GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 500 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA 550 

TCGCTAAACC AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC 600 

TACATCCTTA CTAAAGAAGA AGGTGGACGT CACACTCCAT TCTTCAACAA 650 

CTACCGTCCA CAATTCTACT TCCGTACTAC TGACGTTACA GGTTCAATCG 700 

55 AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA C GTGAC AATC 750 

GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAG 7 87 



60 2 ) INFORMATION FOR SEQ ID NO: 972 

743 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 972 



TAGTTGCTGC GACTG AC GGC CCGATGCCGC AGACTCGTGA GCACATCCTG 50 

mggtcgtc AGGTAGGCGT TCCGTACATC ATCGTGTTCC TGAACAAATG 100 

CgScaTOOTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA ATGGAAGTTC 150 

GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC TCCGATCGTT 200 

20 CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT GGGAAGCGAA 250 

AATCCTGGAA CTGGCTGGCT TCCTGGATTC TTACATTCCG GAACCAGAGC 300 

GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT ATTCTCCATC 350 

T^GTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG GTATCATCAA 400 

AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT CAGAAGTCTA 450 

25 CCTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA AGGCCGTGCT 500 

GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG AAGAAATC G A 550 

acctSgt^ag StaItggcta AGCCGGGCAC CATCAAGCCG CACACCAAGT 60 0 

TCGAATCTGA AGTGTACATT CTGTCCAAAG ATGAAGGCGG CCGTCATACT 650 

CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA CTACTGACGT 700 

3 0 GACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA ATGCCGGGCG 7 50 

acaScaa AATGGTTGTT ACCCTGATCC ACCCGATCGC GATGGACGAC 800 
GGT 



35 



45 



2 ) INFORMATION FOR SEQ ID NO: 973 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 762 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 973 

ACTGACGGCC CGATGCCGCA GACTCGTGAG CACATCCTGC TGGGTCGTCA 50 

GGTAGGCGTT CCGTACATCA TCGTGTTCCT GAACAAATGC GACATGGTTG 100 

ATGACGAAGA GCTGCTGGAA CTGGTTGAAA TGGAAGTTCG TGAACTTCTG 150 

55 TCTCAGTACG ACTTCCCGGG CGACGACACT CCGATCGTTC GTGGTTC TGC 200 

" TCTGAAAGCG CTGGAAGGCG ACGCAGAGTG GGAAGCGAAA ATCCTGGAAC 2 50 

TGGCTGGCTT CCTGGATTCC TACATTCCGG AACCAGAGCG TGC RATTG AC 300 

AAGC CGTTCC TGCTGCCGAT CGAAGACGTA TTCTCCATCT CCGGTCGTGG 350 

TACCGTTGTT ACCGGTCGTG TAGAACGCGG TATCATCAAA GTTGGTGAAG 400 

60 AAGTTGAAAT C GTTGGT ATC AAAGAGACTC AGAAGTCTAC CTGTACTGGC 450 

744 



CA 02307010 2000-05-19 



GTTGAAATGT TCCGCAAACT GCTGGACGAA GGCCGTGCTG GTGAGAACGT 5 00 

AGGTGTTCTG CTGCGTGGTA TCAAACGTGA AGAAATCGAA CGTGGTCAGG 55 0 

TACTGGCTAA GCCGGGCACC ATCAAGCCGC ACACCAAGTT CGAATCTGAA 6 00 

GTGTACATTC TGTCCAAAGA TGAAGGCGGC CGTCATACTC CGTTCTTCAA 650 

AGGCTACCGT CCGCAGTTCT ACTTCCGTAC TACTGACGTG AC TGGT AC C A 7 00 

TTGAACTGCC GGAAGGCGTA GAGATGGTAA TGCCGGGCGA CAACATCAAA 7 50 

ATGGTTGTTA CC 7 62 



10 

2 ) INFORMATION FOR SEQ ID NO: 974 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 804 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974 

CCTGGTAGTT GCTGCGACTG ACGGCCCGAT GC CGC AG ACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAAATGGA 150 

3 0 AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACTCCGA 2 00 

TCGTTCGTGG TTC TGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 

GCGAAAATCC TGGAACTGGC TGGCTTCCTG GATTCCTACA TTCCGGAACC 300 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CCATCTCCGG TCGTGGT AC C GTTGTTACCG GTCGTGTAGA ACGCGGTATC 400 

3 5 ATCAAAGTTG GTGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 45 0 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTGCTGGTGA GAACGTAGGT GTTCTGCTGC GCGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCACCATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGTCGTC 650 

40 ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACTACT 7 00 

GACGTGACTG GTACCATCGA ACTGCCGGAA GGCGTAGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGA 804 

45 

2 ) INFORMATION FOR SEQ ID NO: 975 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 

60 

745 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 5 

GCGATCCTGG TAGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 100 

c TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA 150 

a?5Saa^tc GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC 200 

TCCGATCGTT CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT 250 

GGGAAGCGAA AATCCTGGAA CTGGCTGGCT TCCTGGATTC YTACATTCCG 300 

GAACCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT 350 

10 ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG 400 

GTATCATCAA AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT 450 

CAGAAGTCTA CCTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 500 

AGGCCGTGCT GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCGGGCAC CATCAAGCCG 600 

15 CACACCAAGT TCGAATCTGA AGTGTACATT CTGTCCAAAG ATGAAGGCGG 650 

TCGTCATACT CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA 700 

CTACTGACGT GACTGGTACC ATCGAACTGC CGGAAGGTGT AGAGATGGTA 750 

ATGCCGGGCG ACAACATCAA AATGGTTGTT ACCCTGATCC ACCCGATCGC 800 
GATG 



20 



2) INFORMATION FOR SEQ ID NO : 97 6 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 
35 (B) STRAIN: Mavi-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 6 

GGCGCGATCC TGGTGGTCGC CGCCACCGAC GGC CCGATGC CGCAGACCCG 50 

40 TGAGCACGTG CTGCTCGCCC GTCAGGTCGG TGTGCCCTAC ATCCTGGTCG 100 

CCCTGAACAA GGCCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCCGCC CAGGAGTTCG ACGAGGACGC 2 00 

CCCGGTGGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGC GACGCCAAGT 250 

GGGTCGAGTC CGTCGAGCAG CTGATGGAGG CCGTCGACGA GTCGATCCCG 300 

45 GACCCGGTCC GCGAGACGGA GAAGCCGTTC CTGATGCCGG TGGAGGACGT 350 

CTTCACCATC ACCGGGCGTG GCACCGTGGT CACCGGTCGT GTCGAGCGCG 400 

GTGTGATCAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CCGCCCGACC 450 

AGCACCAAGA CCACGGTCAC CGGTGTGGAR ATGTTCCGCA AGCTGCTCGA 500 

CCAGGGCCAG GCCGGTGACA AC GTCGGTCT GCTGCTGCGT GGTATCAAGC 550 

50 GTGAGGACGT CGAGCGCGGC CAGGTCGTGA CCAAGCCCGG CACCACCACG 600 

CCGCACACCG AGTTCGAGGG CCAGGTCTAC ATCCTGTCCA AGGACGAGGG 650 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTCC 700 

GCACCACCGA CGTGACCGGT GTGGTG AC GC TGCCGGAGGG CACCGAGATG 750 

GTGATGCCCG GTGACAACAC CAACATCTCG GTGAAGCTGA TCCAGCCCGT 800 

55 CGCCATGGAC GACGGTSTGC GGTTC 825 



2 ) INFORMATION FOR SEQ ID NO: 977 

60 

746 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 977 

15 TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGCCACAA ACTCGTGAGC 50 

ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAAGTTG ACTTGGTTGA CGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTATTGT C AG AAT AC G A CTTCCCAGGT GACGATCTTC 200 

CAGTTATCCA AGGTTCAGCA CTTAAAGCTC TTGAAGGTGA CTCTAAATAC 2 50 

2 0 GAAGACATCG TTATGGAATT GATGAACACA GTTGATGAGT ATATCCCAGA 3 00 

AC C AGAACGT GACACTGACA AACCATTGCT TCTTCCAGTC GAGGAC GT AT 3 50 

TCTCAATCAC TGGACGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 400 

ATCGTTAAAG TCAACGACGA AATCGAAATT GTTGGTATCA AAGAAGAAAC 450 

TCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAACTTGACG 5 00 

25 AAGGTCTTGC TGGAGATAAC GTAGGTGTCC TTCTTCGTGG TGTTCAACGT 550 

GATGAAATCG AACGTGGACA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAAGGTG AAGTCTACAT CCTTACTAAA GAAGAAGGTG 65 0 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAATT CTACTTCCGT 700 

ACTACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

3 0 AATGC CTGGT GATAACGTGA CAATCGACGT TGAGTTGATT CACCCAATCG 800 

CCGTAGAACA AGGTACTACA 820 



35 2 ) INFORMATION FOR SEQ ID NO: 97 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: M-Gor-1 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 978 



GGCGCGATCC TGGTGGTCGC CGCCACCGAT GGCCCGATGC CGCAGACCCG 50 

TGAGCACGTG CTGCTCGCCC GTCAGGTGGG CGTGCCCTAC ATCCTGGTGG 100 

CGCTGAACAA GTCCGACGCG GTCGACGACG AGGAGCTGCT CGAGCTCGTC 150 

GAGCTGGAGG TCCGCGAGTT GCTGGCCGCC CAGGACTTCG ACGAGGAAGC 2 00 

55 TCCGGTGGTC CGGGTCTCGG CGCTGAAGGC GCTCGAGGGC GACGCCACCT 25 0 

GGGTGAAGTC GGTAGAGGAC TTGATGGACG CGGTCGACGA GTCGATTCCG 3 00 

GACCCGGTCC GCG AC AC CG A CAAGCCGTTC CTGATGCCCG TCGAGGACGT 3 50 

CTTCACCATC ACCGGTCGTG GCACCGTCGT CACCGGCCGT GTGGAGCGCG 400 

GCGTGGTGAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CAAGCCGACC 450 

60 AGCACCAAGA CCACGGTCAC CGGTGTGGAG ATGTTCCGCA AGCTGCTCGA 500 

747 
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C C AGGGTC AG GCCGGTGACA ACGTCGGTCT 
GTGAGGACGT CGAGCGCGGC CAGGTCGTCA 
CCGCACACCG AGTTCGAGGG TCAGGTCTAC 
CGGCCGGCAC ACGCCGTTCT TCAACAACTA 
5 GCACCACCGA CGTGACCGGT GTGGTGACGC 
GTGATGCCCG GTG AC AAC AC CAACATCTCG 
CGCCATGGAC GACGGTCTGC GG 



GCTGCTGCGT GGTGTCAAGC 550 

TCAAGCCCGG CACCACCACT 600 

ATCCTGTCCA AGGACGAGGG 650 

CCGTCCGCAG TTCTACTTCC 7 00 

TGCCGGAGGG CACCGAAATG 750 

GTGAAGCTGA TCCAGCCCGT 800 

822 



10 



2 ) INFORMATION FOR SEQ ID NO: 979 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 821 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



20 



35 



40 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979 



30 



GCCGTAGAAC AAGGTACTAC A 



2) INFORMATION FOR SEQ ID NO: 980 



ID NO: 979 






CAATGCCACA 


AACTC GTG AG 


50 


AAACACCTTA 


TCGTCTTCAT 


100 


ATTGC TTGAA 


TTGGTTGAAA 


150 


ACTTCCCAGG 


TGACGATCTT 


200 


CTTGAAGGTG 


"ACTCTAAATA 


250 


AGTTGATGAG 


TATATCCCAG 


300 


TTCTTCCAGT 


CGAGGACGTA 


350 


TCAGGACGTA 


TCGACCGTGG 


400 


CGTTGGTATC 


AAAGAAGAAA 


450 


TGTTCCGTAA 


ACAACTTGAC 


500 


CTTCTTCGTG 


GTGTTCAACG 


550 


TAAACCAGGT 


TCAATCAACC 


600 


TCCTTACTAA 


AGAAGAAGGT 


650 


CGTCCACAAT 


TCTACTTCCG 


700 


TCCAGCAGGT 


ACTGAAATGG 


750 


TTGAGTTGAT 


TCACCCAATC 


800 






821 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: ATCC 25177 

60 

748 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 980 

GGTGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CCCAGACCCG 5 0 

CGAGCACGTT CTGCTGGCGC GTCAAGTGGG TGTGCCCTAC ATCCTGGTAG 100 

5 CGCTGAACAA GGCCGACGCA GTGGACGACG AGGAGCTGCT CGAACTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCTGCC CAGGAATTCG ACGAGGACGC 2 00 

CCCGGTTGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGT GACGCGAAGT 2 50 

GGGTTGCCTC TGTCGAGGAA CTGATGAACG CGGTCGACGA GTCGATTCCG 3 00 

GACCCGGTCC GCGAGACCGA CAAGCCGTTC CTGATGCCGG TCGAGGACGT 3 50 

10 CTTCACCATT ACCGGCCGCG GAACCGTGGT CACCGGACGT GTGGAGCGCG 4 00 

GCGTGATCAA CGTGAACGAG GAAGTTGAGA TCGTCGGCAT TCGCCCATCG 450 

ACCACCAAGA CCACCGTCAC CGGTGTGGAG ATGTTCCGCA AGCTGCTCGA 5 00 

CCAGGGCCAG GCGGGCGACA ACGTTGGTTT GCTGCTGCGG GGCGTCAAGC 550 

GCGAGGACGT CGAGCGTGGC CAGGTTGTCA CCAAGCCCGG CACCACCACG 600 

15 CCGCACACCG AGTTCGAAGG CCAGGTCTAC ATCCTGTCCA AGGACGAGGG 650 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTCC 7 00 

GCACCACCGA CGTGACCGGT GTGGTGACAC TGCCGGAGGG CACCGAGATG 7 50 

GTGATGCCCG GTGACAACAC CAACATCTCG GTGAAGTTGA TCCAGCCCGT 800 

CGCCATGGAC GAAGGTNTGC GTTTCGCG 828 

20 

2 ) INFORMATION FOR SEQ ID NO : 981 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 



30 



(ii) MOLECULE - TYPE : Genomic DNA 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 
35 (B) STRAIN: CSG 144 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981 

CGGCGGTATC TTAGTAGTAT CTGCTGCAGA TGGCCCAATG CCACAAACTC 50 

40 GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGC TGAC AT GGTTGACGAC GAAGAATTAT TAGAATTAGT 15 0 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATACGACTTC CCTGGTGACG 2 00 

ATGTACCAGT TATCGTTGGT TCTGCATTAA AAGCTTTAGA AGGCGACCCA 250 

GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCTGTAG ATGACTACAT 3 00 

45 CCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCAACAGG CCGTGTTGAA 400 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATCACTGA 45 0 

AGAAAGCAAG AAAACAACAG TTACAGGTGT AGAAATGTTC CGTAAATTAT 500 

TAGACTACGC TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT 550 

50 GCACGTGAAG ACGTACAACG TGGACAAGTA TTAGCAGCTC CTGGCTCTAT 600 

TACTCCACAC ACAAAATTCA AAGCTGATGT TTACGTTTTA TCTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTC AC TA ACTACCGCCC ACAATTCTAC 7 00 

TTCCGTACTA CTGACGTAAC TGGCGTTGTT CACTTACCAG AAGGTAC TG A 7 50 

AATGGTTATG CCTGGCGATA ACGTAGAAAT GACTGTTGAA TTAATCGCTC 800 

5 5 CAATCGCGAT TGAAGACGG 819 



60 



2 ) INFORMATION FOR SEQ ID NO: 982 

749 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: ( t 

10 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982 

15 CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAGATC G AC TTGGTTG ATGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTCTTG TCAGAATACG ACTTCCCAGG TGACGATCTT 200 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCT CTTGAAGGTG ATACTAAGTA 2 50 

2 0 CGAAGACATC ATCATGGAAT TGATGAACAC TGTTGATGAG TACATCCCAG 300 

" AACCAGAACG TGATAC TG AC AAACCTCTTC TTCTTCCAGT CGAAGACGTA 3 50 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 4 00 

TACTGTTCGT GTCAACGATG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

TCCAAAAAGC AGTTGTT AC T GGTGTTGAAA TGTTCCGTAA ACAGCTTGAC 500 

25 GAAGGTCTTG CAGGGGACAA CGTAGGTGTA CTTCTTCGTG GTATCCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTATCGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAGGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAGT TCTACTTCCG 700 

TACAACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

30 TAATGCCTGG TGATAACGTA ACTATCGACG TTGAGTTGAT CCACCCAATC 800 
GC CGTTGAAC AAGG 



35 2 ) INFORMATION FOR SEQ ID NO: 983 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: bases 

(B) TYPE: 810 Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mltis 

(B) STRAIN: ATCC 49456 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 83 



CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TG AC CTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 2 00 

55 CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 300 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 45 0 

60 CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

750 
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GAAGGTCTTG CCGGAGATAA TGTAGGTGTC CTTCTTCGTG GTGTTCAACG 55 0 

TGATGAAATC GAACGTGGAC AAGTTATTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 65 0 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 7 00 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 75 0 

TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTAGAAC 810 



10 

2 ) INFORMATION FOR SEQ ID NO: 9 84 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 817 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitls 

(B) STRAIN: ATCC 903 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984 

TCCTTGTAGT AGCTTCAACT GACGGACCAA TGCCACAAAC TCGTGAGCAC 50 

ATCCTTCTTT CACGTCAGGT TGGTGTTAAA CACCTTATCG TCTTCATGAA 100 

CAAGATCGAC TTGGTTGATG ACGAAGAATT GCTTGAATTG GTTGAAATGG 150 

30 AAATCCGTGA CCTCTTGTCA GAATACGACT TCCCAGGTGA CGATCTTCCA 2 00 

GTTATCCAAG GTTCAGCTCT TAAAGC TCTT GAAGGTGATA C TAAGTACG A 2 50 

AGACATCATC ATGGAATTGA TGAACACTGT TGATGAGTAC ATCCCAGAAC 3 00 

CAGAACGTGA TACTGACAAA CCTCTTCTTC TTCCAGTCGA AG AC GT ATTC 3 50 

TCAATCACTG GTCGTGGTAC AGTTGCTTCA GGACGTATCG AC CGTGGTAC 400 

3 5 TGTTCGTGTC AACGATGAAA TCGAAATCGT TGGTATCAAA GAAGAAATCC 450 

AAAAAGCAGT TGTTACTGGT GTTGAAATGT TC CGTAAAC A GCTTGACGAA 500 

GGTCTTGCAG GGGACAACGT AGGTGT AC TT CTTCGTGGTA TC C AACGTG A 55 0 

TGAAATCGAA CGTGGTCAAG TTATCGCTAA AC C AGGTTC A ATCAACCCAC 600 

ACACTAAATT CAAGGGTGAA GTTTACATCC TTACTAAAGA AGAAGGTGGA 650 

40 CGTCACACTC CATTCTTCAA CAACTACCGT CCACAGTTCT AC TTC CGTAC 7 00 

AACTGACGTT AC AGGTTC AA TCGAACTTCC AGCAGGTACT GAAATGGTAA 7 50 

TGCCTGGTGA TAACGTAACT ATCGACGTTG AGTTGATCCA CCCAATCGCC 800 

GTTGAACAAG GTACTAC 817 



45 



2 ) INFORMATION FOR SEQ ID NO: 985 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 



60 



751 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 985 



GACGGACCAA 
TGGTGTTAAA 
5 ACGAAGAATT 
GAATACGACT 
TAAAGCTCTT 
TGAACACTGT 
CCATTGCTTC 

10 AGTTGCTTCA 
TCGAAATCGT 
GTTGAAATGT 
AGGTGTCCTT 
TTATCGCTAA 

15 GTCTACATCC 
CAACTACCGT 
TCGAACTTCC 
ATCGACGTTG 



TGCCACAAAC 

CACCTTATTG 

GCTTGAATTG 

TCCCAGGTGA 

GAAGGTGACT 

TGATGAGTAC 

TTCCAGTCGA 

GGACGTATCG 

TGGTATCAAA 

TCCGTAAACA 

CTTCGTGGTG 

ACCAGGTTCA 

TTACTAAAGA 

CCACAATTCT 

TGCAGGTACT 

AGTTGATCCA 



TCGTGAGCAC 

TCTTCATGAA 

GTTGAAATGG 

CGATCTTCCA 

CTAAATACGA 

ATCCCAGAAC 

AG AC GTATTC 

ACCGTGGTAC 

GAAGAAACTC 

ACTTGACGAA 

TTCAACGTGA 

ATCAACCCAC 

AGAAGGTGGA 

ACTTCCGTAC 

GAAATGGTAA 

CCCAATCGCC 



ATCCTTCTTT 

C AAAATTG AC 

AAATCCGTGA 

GTTATCCAAG 

AGACATCATT 

CAGAACGTGA 

TCAATCACTG 

TGTTCGTGTC 

AAAAAGCAGT 

GGTCTTGCCG 

CGAAATCGAA 

ACACTAAATT 

CGTCACACTC 

TACTGACGTT 

TGCCTGGTGA 

GTAGAACAAG 



CACGTCAGGT 50 

TTGGTAGACG 100 

CCTCTTGTCA 150 

GTTCAGCTCT 200 

ATGGAATTGA 250 

CACTGAAAAA 300 

GACGTGGTAC 350 

AACGACGAAA 400 

TGTTACTGGT 450 

G AG AT AAC GT 500 

CGTGGACAAG 550 

TAAAGGTGAA 600 

CATTCTTCAA 650 

ACAGGTTCAA 7 00 

TAACGTGACT 7 50 

GTACTACA 798 



20 



25 



30 



35 



40 



45 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 986 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 986 



TTGTAGTAGC 
CTTCTTTCAC 
AGTTGACTTG 
TCCGTGACCT 
ATCCAAGGTT 
CATCGTTATG 
AACGTGACAC 
ATCACTGGAC 
TAAAGTCAAC 
AAGCAGTTGT 
CTTGCTGGAG 
AATCGAACGT 
CTAAATTCAA 
CACACTCCAT 
TGACGTTACA 
CTGGTGATAA 
GAACAAGGTA 



TTCAACTGAC 
GTCAGGTTGG 
GTTGACGACG 
ATTGTCAGAA 
CAGCACTTAA 
GAATTGATGA 
TGACAAACCA 
GTGGTACAGT 
GACGAAATCG 
TACTGGTGTT 
ATAACGTAGG 
GGACAAGTTA 
AGGTGAAGTC 
TCTTCAACAA 
GGTTCAATCG 
CGTGACAATC 
CTACA 



GG AC CAATGC 
TGTTAAACAC 
AAGAATTGCT 
TACGACTTCC 
AGCTCTTGAA 
ACACAGTTGA 
TTGCTTCTTC 
TGCTTCAGGA 
AAATCGTTGG 
GAAATGTTCC 
TGTCCTTCTT 
TCGCTAAACC 
TACATCCTTA 
CTACCGTCCA 
AACTTCCAGC 
GACGTTGAGT 



CACAAACTCG 
CTTATCGTCT 
TGAATTGGTT 
CAGGTGACGA 
GGTGACTCTA 
TGAGTATATT 
CAGTCGAGGA 
CGTATCGACC 
TATCAAAGAA 
GTAAACAACT 
CGTGGTGTTC 
AGGTTCAATC 
CTAAAGAAGA 
CAATTCTACT 
AGGTACTGAA 
TGATTCACCC 



TGAGCACATC 50 

TCATGAACAA 100 

GAAATGG AAA 150 

TCTTCCAGTT 200 

AATACGAAGA 250 

CCAGAACCAG 300 

C GTATTC TC A 3 50 

GTGGTATCGT 400 

GAAACTCAAA 450 

TGACGAAGGT 500 

AACGTGATGA 550 

AACCCACACA 600 

AGGTGGACGT 650 

TCCGTACTAC 7 00 

ATGGTAATGC 75 0 

AATCGCCGTA 800 
815 



60 



2 ) INFORMATION FOR SEQ ID NO: 987 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: jEntero coccus hlrae 
10 (B) STRAIN: ATCC 8043 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 987 

CGGACAATCT TGGTTGTTTC TGCTACGGAT GGCCCAATGC CTCAAACTCG 5 0 

15 TGAACACATT TTACTTTCTC GCCAAGTAGG CGTGAAATAT TTGATTGTTT 100 

TCTTGAACAA AACAGATTTA GTCGATGATG AAGAATTAAT TGATCTAGTA 150 

GAAATGGAAG TTCGTGAACT ATTAAGCGAA TATGGATTTC CAGGTGATGA 200 

TACACCTGTT ATCAAAGGAT CAGCATTAAA AGCATTACAA GGTGATCCTG 250 

ATGCAGAAGC AGCTATCATG GAATTGATGG ACACAGTCGA TGAATACATC 3 00 

20 CCAACACCAG AACGTGATAC GGACAAACCA TTATTGTTAC CCGCCGAAGA 3 50 

TGTATTTTCA ATTAC TGG AC GTGGAACAGT AGCATCTGGT CGGATTGACC 400 

GTGGGGCTGT TAGAGTCGGT GACGAAATCG AAATC GTAGG GATCAAACCA 450 

GAAACGCAAA GAGCCGTAGT AACAGGAGTT GAAATGTTCC GCAAAACGCT 500 

TGATTACGGT GAAGCAGGGG ATAACGTAGG TGTGTTATTA CGTGGGATTC 550 

25 AAAGAGAAGA CATCGAACGT GGCCAAGTGA TTGCCAAACC TGGTTCAATT 600 

ACACCTCATA CTAAATTCAA AGCAGAAGTT TAGGTTTTGA CTAAAGAAGA 650 

AGGCGGACGT CATACACCAT TCTTCAATAA TTATCG AC C A CAATTTTATT 7 00 

TCCGCACAAC TGACGTAACA GGAACAATTG TTTTGCCAGA AGGAACGGAA 750 

ATGGTCATGC CTGGCGACAA CGTAACGATC GATGTAGAAT TGATTCATCC 800 

3 0 TGTTGCTATT GAAAACGGGA CGACTTTCTC CA 832 



35 



2) INFORMATION FOR SEQ ID NO: 988 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988 

50 TGGTGCGATC TTAGTTGTTT CTGCGACAGA TGGACCAATG CCTCAAACTC 5 0 

GCGAGCATAT C CTTTTATC A CGTCAAGTTG GTGTAAAATA TTTGATTGTA 100 

TTTTTGAATA AAGTTGATTT GGTCGATGAT GAAGAATTGA TCGATCTTGT 150 

AGAAATGGAA GTTCGTGAAT TACTGAATGA AT ATGG TTTC CCAGGTGATG 200 

ACACACCTGT CATCAAAGGC TCTGCATTGA AAGCATTGCA AGGTGACCCA 250 

55 GAAGCAGAAG CTGCGATCAA TGAATTGATG GAAACAGTGG ATGACTATAT 3 00 

CCCAACACCA GAACGTGATA CCGACAAACC ATTGCTTTTA CCAGTTGAAG 350 

ATGTTTTCTC AATC AC TGGT CGCGGAACGG TAGCATCTGG TCGTATCGAC 400 

CGTGGAGCCG TTCGTGTCGG TGATG AAATC GAGATCATCG GAATCAAACC 450 

TGAAACGAAA AAAGCGGTTG TGACAGGGGT AGAGATGTTC CGTAAAACGT 500 

60 TAGATTATGG CGAAGCTGGA GATAACGTAG GAATCTTGTT ACGTGGTATC 550 
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CAAAGAGAAG ATATTGAACG 
CACACCACAT ACAAAATTCA 
AAGGCGGACG TCATACACCA 
TTCCGCACAA CAGATGTAAC 
5 AATGGTCATG CCTGGAGACA 
CAGTGGCAAT CGAACAAGGA 



TGGACAAGTA ATTGCGAAAC 
AAGCGGAAGT TTATGTATTG 
TTCTTCAATA ACTACCGCCC 
AGGTACGATC GTGTTGCCAG 
ACGTAACCAT CGAGGTAGAG 
ACGACTTTCT CTATT 



CTGGTTCAAT 6 00 

ACGAAAGAAG 650 
ACAATTTTAT 700 
AAGGAACAGA 7 50 

TTGATCCATC 800 
835 



10 2 ) INFORMATION FOR SEQ ID NO: 989 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 



25 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: 



ID NO: 989 






ACCAATGCCT 


CAAACACGTG 


50 


TTAAATACTT 


GATCGTCTTC 


100 


GAATTGATTG 


ATTTAGTTGA 


150 


TGGTTTCCCA 


GGCGATGATA 


200 


C TTT AGAAGG 


CGATCCTGAA 


250 


"ACGGTTGACG 


AATATATCCC 


300 


C TTGTTACC A 


GTGGAAGATG 


350 


CATCTGGTCG 


TATTGATCGT 


400 


ATTATCGGGA 


TCAAACCTGA 


450 


AATGTTCCGT 


AAAACATTGG 


500 


TTCTATTACG 


TGGTATTACA 


550 


GCTAAACCAG 


GTTCAATTAC 


600 


TGTGTTGACG 


AAAGAAGAAG 


650 


ATCGTCCTCA 


ATTCTACTTC 


700 


TTGCCAGAAG 


GTACTGAAAT 


750 


CGTTGAATTA 


ATCC ATCC AA 


800 


TT 




832 



45 

2) INFORMATION FOR SEQ ID NO : 990 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 154 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracls 

(B) STRAIN: CIP 9440 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 990 

754 
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GGATCCTGTA TATGCACAAA AACTAGGTGT 
TATCACAACC TGATACAGGG GAGCAAGGTT 
GTACGAAGTG GTGCGGTTGA TATTATCGTA 
5 TGTA 



TAACATCGAT G AATTAC TAT 50 
TAGAAATCGC AGAAGCACTT 100 
ATTGACTCTG TAGCAGCTCT 150 

154 
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2 ) INFORMATION FOR SEQ ID NO : 991 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 384 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 ( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

2 0 (A) ORGANISM: Prevotella melaninogenica 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 991 

25 GCCATTGCAG AGGCACAGAA GCAGGGCGGT ATTGCAGCCT TCATTGATGC 50 

TGAGCACGCC TTCGACCGTT TCTATGCAGA GAAGTTAGGT GTGGATGTTG 100 

AT AAC CTTTG GGTTTCACAG CCAGACAATG GTGAGCAGGC TTTAGAGATT 150 

GCCGACCAGC TGATTCGCTC TTCCGCTATT GACATTCTCG TTGTCGACTC 2 00 

AGTTGCAGCC TTGACTCCAA AGAAGGAGAT TGAGGGTGAC ATGGGTGACT 250 

3 0 CTGCAGTAGG TTTACAAGCA CGACTGATGA GTCAGGCATT GCGTAAACTT 3 00 

ACCTCAACAA TCGCAAAAAC TAATAC TTGC TGCATCTTCA TCAACCAGTT 350 
GCGTGAGAAG ATTGGTGTGA TGTTTGGTAA TCCA 384 



35 



45 



2 ) INFORMATION FOR SEQ ID NO: 992 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R760 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 992 

ACACGTGAAC AC ATC TTGTT ATCACGTAAC GTTGGTGTAC CATACATCGT 50 

TGTTTTC TTA AACAAAATGG ATATGGTTGA TGACGAAGAA TT AC TAG AAT 100 

TAGTTGAAAT GGAAGTTCGT GAC TTATTGT CAGAATATGA CTTCCCAGGC 150 

55 GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC TTGAAGGCGA 2 00 

TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA GTTGACGAAT 2 50 

ACGTTCCAAC TCCAGAACGT GAC AC TG AC A AACCATTCAT GATGCCAGTC 3 00 

GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA CAGGCCGTGT 3 50 

TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC GTTGGTATTG 400 

60 CTGAAGAAAC TGC TAAAAC A ACTGTAACTG GTGTTGAAAT GTTC C GT AAA 450 
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TTGTTAG AC T ATGCTGAAGC AGGGGATAAC 
TGTTGCTCGT GAAGACATCC AACGTGGACA 
CAATCACACC TCATACAAAA TTTAAAGCTG 
GAAGAAGGTG GACGTCATAC ACCA 



ATTGGTGCAT TGCTACGTGG 5 00 

AGTATTGGCT AAAGCTGGTA 550 
AAGTTTACGT TTTAACAAAA 600 

624 



2 ) INFORMATION FOR SEQ ID NO: 993 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 
20 (C) ACCESSION NUMBER: extracted from U40453 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 993 

ATGGAAAACA ATAAAAAAGT ATTGAAGAAA ATGGTATTTT TTGTTTTAGT 50 

2 5 GACATTTCTT GGACTAACAA TCTCGCAAGA GGTATTTGCT CAACAAGACC 100 

CCGATCCAAG CCAACTTCAC AGATCTAGTT TAGTTAAAAA CCTTCAAAAT 150 

ATATATTTTC TTTATGAGGG TGACCCTGTT ACTCACGAGA ATGTGAAATC 2 00 

TGTTGATCAA CTTTTATCTC ACGATTTAAT ATATAATGTT TCAGGGCCAA 25 0 

ATTATGATAA ATTAAAAACT GAACTTAAGA ACCAAGAGAT GGCAACTTTA 3 00 

3 0 TTTAAGGATA AAAACGTTGA TATTTATGGT GTAGAATATT ACCATCTCTG 350 

TTATTTATGT G AAAATGC AG AAAGGAGTGC ATGTATCTAC GGAGGGGTAA 40 0 

CAAATCATGA AGGGAATCAT TTAGAAATTC CTAAAAAGAT AGTC G TT AAA 45 0 

GTATCAATCG ATGGTATCCA AAGCCTATCA TTTGATATTG AAACAAATAA 500 

AAAAATGGTA ACTGCTCAAG AATTAGACTA TAAAGTTAGA AAATATCTTA 55 0 

3 5 CAGATAATAA GCAACTATAT ACTAATGGAC CTTCTAAATA TGAAACTGGA 600 

TATATAAAGT TCATACCTAA GAATAAAGAA AGTTTTTGGT TTGATTTTTT 65 0 

CCCTGAACCA GAATTTACTC AATCTAAATA TCTTATGATA TATAAAGATA 7 00 

ATGAAACGCT TGACTCAAAC AC AAGC C AAA TTGAAGTCTA CCTAACAACC 7 50 

AAGTAA /Db 

40 



2 ) INFORMATION FOR SEQ ID NO: 994 

45 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994 
55 TGG AC T AAC A ATCTCGCAAG AGG 23 



2 ) INFORMATION FOR SEQ ID NO : 995 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 995 
ACATTCTCGT GAGTAACAGG GT 22 



10 



15 2 ) INFORMATION FOR SEQ ID NO: 996 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

2 0 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 996 

ACAAATCATG AAGGGAATCA TTTAG 25 

30 

2 ) INFORMATION FOR SEQ ID NO: 997 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997 
CTAATTCTTG AGCAGTTACC ATT 23 

2 ) INFORMATION FOR SEQ ID NO : 998 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998 
GGAGGGGTAA CAAATCATGA AGG 23 

60 
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2 ) INFORMATION FOR SEQ ID NO: 999 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999 
15 TTGACCTTGT TGATGACGAA GAG 



2 ) INFORMATION FOR SEQ ID NO: 1000 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1000 
TTAGTGTGTG UGTTGATTGA ACT 



35 2 ) INFORMATION FOR SEQ ID NO: 1001 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 

AAGAGTTGCT TGAATTAGTT GAG 



2 ) INFORMATION FOR SEQ ID NO : 1002 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 

758 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: ATCC 700294 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1002 

AACATGATCA CTGGTGCCGC TCAAATGGAC GGAGCTATCC TTGTAGTTGC 50 

TTCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC 100 

GTCAGGTTGG TGTTAAACAC CTTATCGTGT TCATGAACAA AGTTGACCTT 150 

10 GTTGATGACG AAGAGTTGCT TGAATTAGTT GAGATGGAAA TTCGTGACCT 2 00 

TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTTCCAGTT ATCCAAGGTT 2 50 

CAGCTCTTAA AGCTCTTGAA GGCGACACTA AATTTGAAGA CATCATCATG 3 00 

GAATTGATGG ATACTGTTGA TTCATACATT CCAGAACCAG AACGCGACAC 350 

TGACAAACCA TTGCTTCTTC CAGTCGAAGA CGTATTCTCA ATTACAGGTC 4 00 

15 GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTACTGT TCGTGTCAAC 45 0 

GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTAAAA AAGCTGTTGT 5 00 

TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCAGGAG 550 

ACAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA AATCGAACGT 600 

GGTCAAGTTA TTGC TAAACC AAGTTCAATC AACCCACACA CTAAATTCAA 650 

2 0 AGGTGAAGTA TATATCCTTT CTAAAGACGA AGGTGGACGT CACACTCCAT 700 

TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACAAC TGACGTAACA 75 0 

GGTTCAATCG AACTTCCAGC AGGTACAGAA ATGGTTATGC CTGGTGATAA 800 

CGTGACAATC AACGTTGAGT TGATCCACCC AATCGCCGTA GAACAAGGTA 850 

CTACTTTCTC AATC CGTGAA GGTGGACGTA CTGTTGGTTC AGGT 894 

25 



2 ) INFORMATION FOR SEQ ID NO: 1003 

3 0 _(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 32 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
40 (B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003 

AATGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTAC 5 0 

45 TATTATCACA GCC TGAT AC A GGGGAGCAAG GATTAGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA CATGGGTGAC TCACACGTAG 2 00 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTCGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CAAAAACAAT TGCAATC TTT ATTAACCAAA TTCGTGAAAA 300 

50 AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AA 332 



2 ) INFORMATION FOR SEQ ID NO: 1004 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

759 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: streptococcus pneumoniae 

(B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004 

10 ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

ttcaaa-Eggt AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

aaIcaatcac CGATTATGCA CCTGCCATAG AATACGGTGT 250 

m TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 3 00 

ga^SKgcac ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTC TTC A ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

SgStaag GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

20 aSgaatcta ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

Jtatgctgcc TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

JcTCCTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

25 CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTC CTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATC C AG AAG ACTGGACGAT 1000 

30 GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

3 gccuSSatg GACTGAACCC tctactcaac aatcctcaac agctgaaagt 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 
AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 
CAAATACAAC CC 

35 

2 ) INFORMATION FOR SEQ ID NO: 1005 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-02 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1005 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

55 TACCCTGACG ATGATTTGCA AGTCGCATCT AC GGTC GTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

60 GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

760 
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CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 4 00 

TAAGGTCGGT C T AGAT AG AG CTAAAACCTT CCTTAATGGT CTTGGTATCG 45 0 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 5 00 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 55 0 

5 TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 600 

TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

GCTATGAAAG AAACAACTGC TTACATGATG AC CG AAATG A TGAAAACTGT 700 

CTTGGCATAC GGAACTGGTC GTGGAGC CTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 800 

10 ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 85 0 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 1000 

GGGGCTCTAC AGAAATGGAG AATTC GTATT TAAAAATGGT GCTCGTTCTA 1050 

15 CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AACTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACG ACT AC C A ATCCTAACAA TAATACGCAA CAATCAAATA 12 00 
CAACCCCTGA TC .1212 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1006 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1213 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006 



ACCAAGAAGC TCAAAAACAT CTGTGGGATA TC TAC AAC TC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

40 TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AAT AC GGTGT 250 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

45 ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTC TAG ATAGAGCTAA AACCTTCCTT AATGGTC TTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCCAACG GTGGAATTTA CCACAAACCC ATGTATATCA 600 

50 ATAAGGTCGT CTTCAGTGAC GGTAGTAAAA AAGAATTTTC AGATGTAGGT 650 

ACACGAGCTA TGAAAGAAAC AACTGCTTAC ATGATGACCG AAATGATGAA 700 

AACTGTCTTG GCATACGGAA CTGGTCGTGG AGCCTATCTC CCATGGTTAG 750 

CGCAAGCTGG TAAGACAGGT ACTTCTAACT ACACAGATGA TGAAATTGAA 800 

AAACACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

55 TTATAC TCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 9 00 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTGTC TGAAGGAAGC AATCCAGAGG ATTGGAATAT 1000 

ACCAGAGGGG CTCTACAGAA ATGGAGAATT C GTATTT AAA AATGGTGCTC 1050 

GTTCTACGTG GAGCTCACCT GCTCCACAAC AACCCCCATC AACTGAAAGT 1100 

60 TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAACT CAACCACTCC 1150 
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AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 
CAAATACAAC CCC 



10 



15 



2 ) INFORMATION FOR SEQ ID NO: 1007 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1007 

ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

2 5 ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGAC TGGGGT 2 00 

tc?gSatga AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA AC AATC AC G A AATGTCACAG CCGTTGAGAC 400 

30 TTTGAATAAG GTCGGTCTAG AT AGAGC TAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

35 ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

40 TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATC C AG AAG ACTGGACGAT 1000 

GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 

45 AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 

CAAATACAAC CCCTGATC 12 1B 



50 2 ) INFORMATION FOR SEQ ID NO: 1008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

762 
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(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1008 

5 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA 150 

AGTAACGTTT CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG 2 00 

10 GGGTTCTGCT ATGAAACCAA TC AC C G ATT A TGCACCTGCC ATAGAATACG 250 

GTGTTTATGA TTCCACTGCA AC TATGGTT A ATGATATTCC TTATAACTAT 300 

CCGGGAACAA GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA 3 50 

TATTACTCTG CAATATGCTC TTCAACAATC AC G AAATGTC ACAGCCGTTG 400 

AGACTTTGAA TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT 450 

15 CTTGGTATCG ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA 500 

TACAACAGAA TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG 550 

CTGCTTATGC TGCCTTTGCA AATGGTGG C A CTTACTATAA ACCAATGTAT 600 

ATCCATAAAG TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT 650 

CGGAACTCGT GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA 7 00 

2 0 TGAAAACAGT CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG 7 50 

CTTCCTCAAG CTGGTAAAAC AGGAACCTCT AAC TAT AC AG ATGAGGAAGT 800 

TGAAAAC C AC ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG 850 

TTGGTTATAC TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT 900 

CGTTTAACTC CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA 950 

25 TCGCTCAATG AT AAC GTATC TATCAGAAGA TACTCATCCA GAAGACTGGA 1000 

CGATGCCAGA CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA 1050 

GCTCGCCCAA TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA 1100 

AAGTTCAAGC TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA 1150 

CTCCAAGCAC AAATAATAGT ACG AC T AC C A ATCCTAACAA TAATACGCAA 12 00 

3 0 CAATCAAATA CAACCCCTGA TCA 1223 



2 ) INFORMATION FOR SEQ ID NO: 1009 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1214 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1009 

50 ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AAT AC GGTGT 25 0 

55 TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 35 0 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

60 ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 
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TTATGCTGCC TTTGCAAATG GTGGCACTTA 
ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA 
ACTCGTGCCA TGAAGGAAAC GACAGCCTAT 
AACAGTCTTG ACTTATGGAA CTGGGCGTGG 
5 CTCAAGCTGG TAAAACAGGA ACCTCTAACT 
AACCACATCA AG AAC AC TGG CTATGTAGCT 
TTATACTCGT AAGTATTCTA TGGCTGTATG 
TAACTCCTAT CGTTGGAGAT GGTTTCCTAG 
TCAATGATAA CGTATC TATC AGAAGATACT 
10 GCCAGACGGA CTTTTCAGAA ACGGGGAATT 
GCCCAATATG GACTGAACCC TCTACTCAAC 
TCAAGCTCAT CATCAGATAG TTCAACTTCA 
AAGCACAAAT AATAGTACGA CTACCAATCC 
CAAATACAAC CCCT 

15 



CTATAAACCA ATGTATATCC 600 

AAGAGTTCTC TAATGTCGGA 650 

ATGATGACCG ACATGATGAA 7 00 

AGCCTATCTT CCTTGGCTTC 7 50 
ATACAGATGA GGAAGTTGAA 800 
C C AG ATG AAA TGTTTGTTGG 850 
GACAGGTTAT TCGAATCGTT 9 00 

TTGCAGCTAA AGTTTATCGC 950 

CATCCAGAAG ACTGGACGAT 1000 

TGTATTC AAA AATGGAGCTC 1050 

AATCCTCAAC AGCTGAAAGT 1100 

CAGTCTAGCT CAACCACTCC 1150 

TAACAATAAT ACGCAACAAT 12 00 

1214 



2) INFORMATION FOR SEQ ID NO: 1010 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
30 (B) STRAIN: StrR-07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1010 

CCAAGAAGCT CAAAAACATC TGTGGGATAT CTACAACTCC GATCAATACG 50 

3 5 TCTCTTACCC TGACGATGAT TTGCAAGTCG CATCTACGGT CGTAGATGTT 100 

"~ TCAAATGGTA AAGTCATCGC ACAACTTGGT GCTCGTCATC AAGCAAGTAA 150 

TGTTTCATTC GGTACCAACC AGGCCGTAGA AACCAATCGT GACTGGGGAT 200 

CATCAATGAA ACCAATCACT GACTATGCTC CC GCTTT AG A ATATGGAGTC 250 

TATGACTCTA CTGCTTCTAT TGTACATGAT GTCCCTTATA ACTATCCTGG 3 00 

40 CACTGATACT CCACTCTACA AC TGGG ATC A TGTCTACTTT GGAAACATTA 3 50 

CAATCCAGTA TGCTCTTCAA CAATCACGAA ATGTCACAGC CGTTGAGACT 400 

TTGAATAAGG TCGGTCTAGA TAG AGC TAAA ACCTTCCTTA ATGGTCTTGG 450 

TATCGACTAT CCAAGCATGC ATTATGCAAA CGCCATTTCA AGTAACACAA 500 

CTGAATCCAA CAAAAAATAT GGTGCAAGTA GTGAAAAAAT GGCTGCTGCC 550 
45 TACGCTGCTT TTGCTAATGG TGGTATTTAT CACAAACCAA TGTATATCAA 600 
TAAAATC GTC TTTAGTGATG GTAGCGAAAA AGAATTTTCT GATGCTGGTA 650 
CACGAGCTAT GAAAGAGACT AC TGC CTAT A TGATGACTGA AATGATGAAA 700 
ACTGTTTTAA CTTAC GGAAC AGGACGTGGA GCCTACCTAC CATGGCTTCC 7 50 

ACAAGCAGGT AAGACAGGTA CTTCTAACTA TAC TGACGAA GAAATTGAAA 800 
50 AGTATATCAA GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG 850 
TATACCCGTA AATATGCAAT GGCTGTTTGG ACAGGATACT CAAATCGTCT 900 
AACTCCAATC ATCGGAGATG GTTTCCTTGT TGCTGGTAAA GTCTATCGTT 950 

CAATGATAAC TTACCTTTCT GAAGATGACC AACCTGGAGA TTGGACAATG 1000 

C C AG ATGGCT TGTATAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG 1050 

55 TTCTACGTGG AGCTCACCTG CTCCACAACA ACCCCCATCA AC TG AAAGTT 1100 

CAAGCTCATC ATCAGATAGT TCAACTTCAC AGTCTAACTC AACCACTCCA 1150 

AGCACAAATA ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC 1200 

AAATACAACC CCTGATCAAC AAA 1223 



764 
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2 ) INFORMATION FOR SEQ ID NO: 1011 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1207 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-08 



15 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1011 



AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

20 TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 2 00 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA AC TATGGTT A ATGATATTCC TTATAAC TAT CCGGGAACAA 3 00 

GCACACCTGT C T AC AACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 3 50 

25 CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AG AC TTTG AA 400 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 5 00 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCC TTTGC A AATGGTGGCA CTTACTATAA AC C AATGTAT ATCCATAAAG 600 

3 0 TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT CGGAACTCGT 650 

GCCATGAAGG AAACGACAGC CTATATGATG AC CGAC ATG A TGAAAACAGT 7 00 

CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG CTTCCTCAAG 750 

CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT TG AAAACC AC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 85 0 

3 5 TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA CGATGCCAGA 1000 

CGGACTTTTC AGAAAC GGGG AATTTGTATT CAAAAATGGA GCTCGCCCAA 1050 

TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA AAGTTCAAGC 1100 

40 TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 12 00 

CAACCCC 12 07 



45 
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2 ) INFORMATION FOR SEQ ID NO: 1012 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1201 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1012 

765 
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AAGCTCAAAA 
TACCCTGACG 
TGGTAAAGTC 
5 CATTTGGTAC 
ATGAAACCAA 
TTCCACTGCA 
GCACACCTGT 
CAATATGCTC 

10 TAAGGTCGGT 
ACTATCCAAG 
TCTAATAAAC 
TGCCTTTGCC 
TCGTCTTCAG 

15 GCTATGAAAG 
CTTGGCATAC 
CTGGTAAGAC 
ATCAAGAACA 
TCGTAAGTAT 

20 CTATCGTTGG 
ATAACGTATC 
CGGACTTTTC 
CGTGGAACTC 
TCATCATCAG 

25 AAATAATAGT 
C 



ACATCTGTGG 

ATGATTTGCA 

ATCGCCCAAC 

CAACCAAGCT 

TC AC C G ATT A 

AC TATGGTT A 

CTACAACTGG 

TTCAACAATC 

CTAGATAGAG 

CATGCATTAT 

AATACGGAGC 

AACGGTGGAA 

TGACGGTAGT 

AAACAACTGC 

GGAACTGGTC 

AGGTACTTCT 

CTGGCTATGT 

TCTATGGCTG 

AGATGGTTTC 

TATCAGAAGA 

AGAAACGGGG 

ACCTGCTCCA 

ATAGTTCAAC 

ACGACTACCG 



GATATCTACA 

AGTCGCATCT 

TTGGAGCTCG 

GTGGAAACCA 

TGCACCTGCC 

ATGATATTCC 

GATAGAGCAT 

ACGAAATGTC 

CTAAAACCTT 

GCAAACGCCA 

AAGTAGTGAA 

TTTACCACAA 

AAAAAAGAAT 

TTACATGATG 

GTGGAGCCTA 

AACTACACAG 

AGCTCCAGAT 

TATGGACAGG 

C T AGTTGC AG 

TACTCATCCA 

AATTTGTATT 

CAACAACCCC 

TTCACAGTCT 

ATCCTAACAA 



ACTCCGATCA 

ACGGTCGTAG 

TCACCAAGCA 

ATCGTGACTG 

ATAGAATACG 

TTATAACTAT 

ATTTCGGTAA 

ACAGCCGTTG 

CCTTAATGGT 

TTTCAAGTAA 

AAAATGGCTG 

AC C C ATGT AT 

TTTCAGATGT 

AC CG AAATG A 

TCTCCCATGG 

ATGATGAAAT 

GAAATGTTTG 

TTATTCGAAT 

CTAAAGTTTA 

GAAGACTGGA 

CAAAAATGGA 

CATCAACTGA 

AGCTCAACCA 

TAATACGCAA 



ATACGTCTCT 50 

ATGTTTC AAA 100 

AGTAACGTTT 150 

GGGTTCTGCT 2 00 

GTGTTTATGA 2 50 

CCGGGAAC AA 3 00 

TATTACTCTG 350 

AGACTTTGAA 400 

CTTGGTATCG 4 50 

TACAACAGAA 5 00 

CTGCTTATGC 550 

ATCAATAAGG 600 

AGGTACACGA 650 

TGAAAACTGT 7 00 

TTAGCGCAAG 750 

TGAAAAACAC 800 

TTGGTTATAC 850 

CGTTTAACTC 900 

TCGCTCAATG 950 

CGATGCCAGA 1000 

GCTCGTTCTA 105 0 

AAGTTCAAGC 1100 

CTCCAAGCAC 1150 

C AATCAAATA 1200 
1201 



30 2 ) INFORMATION FOR SEQ ID NO: 1013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 0 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : Doub 1 e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 



35 



40 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATCTAC 
AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC 
GATGTTTCAA ATGGTAAAGT CATCGCACAA CTTGGTGCTC 
AAGTAATGTT TCATTCGGTA CCAACCAGGC CGTAGAAACC 
GGGGATCATC AATGAAACCA ATCACTGACT ATGCTCCCGC 
GGAGTCTATG ACTC TACTGC TTCTATTGTA CATGATGTCC 
TCCTGGCACT GATACTCCAC TCTACAACTG GGATCATGTC 
ACATTACAAT CCAGTATGCT CTTCAACAAT C AC G AAATGT 
GAGACTTTGA ATAAGGTCGG TC TAG AT AG A GCTAAAACCT 
TCTTGGTATC GACTATCCAA GCATGCATTA TGCAAACGCC 
AC AC AAC TG A ATCCAACAAA AAATATGGTG CAAGTAGTGA 
GCTGCCTACG CTGCTTTTGC TAATGGTGGT ATTTATCACA 
TATCAATAAA ATCGTCTTTA GTGATGGTAG CGAAAAAGAA 
C TGGT AC AC G AGCTATGAAA G AG ACTAC TG CCTATATGAT 
ATGAAAACTG TTTTAACTTA CGGAACAGGA CGTGGAGCCT 

766 



AACTCCGATC 50 

TACGGTCGTA 100 

GTCATCAAGC 150 

AATCGTGACT 200 

TTTAGAATAT 250 

CTTATAACTA 3 00 

T AC TTTGG AA 350 

CACAGCCGTT 4 00 

TCCTTAATGG 450 

ATTTCAAGTA 500 

AAAAATGGCT 550 

AACCAATGTA 600 

TTTTC TGATG 650 

GACTGAAATG 7 00 

AC CTACC ATG 7 50 
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GCTTCCACAA GCAGGTAAGA CAGGTACTTC 
TTGAAAAGTA TATCAAGAAC ACTGGCTACG 
GTGGGTTATA CTCGTAAGTA TTCTATGGCT 
TCGTTTAACT CCTATCGTTG GAGATGGTTT 
5 ATCGCTCTAT GATGACCTAC CTGTCTGAAG 
AATATAC C AG AGGGGCTCTA CAGAAATGGA 
TGCTCGTTCT ACGTGGAGCT CACCTGCTCC 
AAAGTTCAAG CTCATCATCA GATAGTTCAA 
ACTCCAAGCA CAAATAATAG TACGACTACC 
10 ACAATCAAAT ACAACCCCTG 



TAACTATACT GACGAAGAAA 800 

TAGCTCCAGA TGAAATGTTT 850 

GTATGGACAG GTTATTCGAA 9 00 

CCTAGTTGCA GCTAAAGTTT 950 

GAAGCAATCC AGAGGATTGG 10 00 

GAATTCGTAT TTAAAAATGG 105 0 

ACAACAACCC CCATCAACTG 1100 

CTTCACAGTC TAGCTCAACC 1150 

AATCCTAACA ATAATACGCA 12 00 

1220 



2) INFORMATION FOR SEQ ID NO: 1014 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014 

3 0 CAAAAACATC TGTGGGATAT. TTACAATACA GACGAATACG TTGCCTATCC 50 

AGACGATGAA TTGCAAGTCG CTTCTACCAT TGTTGATGTT TC T AAC GGT A 100 

AAGTCATTGC CCAGCTAGGA GCACGCCATC AGTCAAGTAA TGTTTCCTTC 15 0 

GGAATTAACC AAGCAGTAGA AACAAACCGC GACTGGGGAT CAACTATGAA 200 

ACCGATCACA GACTATGCTC CTGCCTTGGA GTACGGTGTC TACGATTCAA 250 

3 5 CTGCTACTAT CGTTC AC GAT GAGCCCTATA ACTACCCTGG GACAAATACC 3 00 

CCTGTTTATA ACTGGGATAG GGGCTACTTT GGCAACATCA CCTTGCAATA 350 

CGCCCTGCAA CAATCGCGAA ACGTCCCAGC CGTGGAAACT CTAAACAAGG 400 

TCGGACTCAA CCGCGCCAAG ACTTTCCTAA ATGGTCTCGG AATCGACTAC 450 

CCAAGTATTC ACTACTCAAA TGCCATTTCA AGTAACACAA CCGAATCAGA 500 

40 CAAAAAATAT GGAGCAAGTA GTGAAAAGAT GGCTGCTGCT TACGCTGCCT 550 

TTGCAAATGG TGGAACTTAC TATAAACCAA TGTATATCCA TAAAGTCGTC 600 

TTTAGTGATG GGAGTGAAAA AG AGTTC TCT AATGTCGGAA CTCGTGCCAT 650 

GAAGGAAACG ACAGCCTATA TGATGACCGA CATGATGAAA ACTGTCTTAG 700 

TATACGGAAT CGGACGTGGA GCCTACCTAC CTTGGCTTCC ACAAGCAGGT 75 0 

45 AAAACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA AGTATATCAA 800 

GAACACTGGT T ACGTAGC TC CAGATGAAAT GTTTGTAGGG TATACCCGCA 850 

AATATGCAAT GGCTGTATGG ACAGGCTATT CTAACCGTCT GACACCACTT 900 

GTAGGCGATG GCCTTACGGT CGCTGCTAAA GTTTACCGCT CTATGATGAC 950 

CTACCTGTCT GAAGGAAGCA ATCCAGAGGA TTGGAATATA C C AG AGGGGC 1000 

50 TCTACAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG TTCTACGTGG 105 0 

AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CAAGCTCATC 1100 

ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGC AC AAATA 115 0 

ATAGTAC GAC TACCAATCCT AACAATAATA CGCAACAATC AAATACAAC 1199 



55 



2) INFORMATION FOR SEQ ID NO: 1015 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1211 bases 

767 
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10 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1015 



AAGCTCAAAA 
TACCCTGACG 
15 TGGTAAAGTC 
CATTTGGTAC 
ATGAAACCAA 
TTCCACTGCA 
GCACACCTGT 

2 0 C AATATGCTC 

TAAGGTCGGT 
ACTATCCAAG 
TCTAATAAAC 
TGCCTTTGCC 
25 TCGTCTTCAG 
GCTATGAAAG 
C TTGGC AT AC 
CTGGTAAGAC 
ATCAAGAACA 

3 0 TC GTAAGT AT 

CT ATC GTTGG 
ATAACGTATC 
GGGGCTCTAC 
CGTGGAGCTC 
3 5 TCATC ATCAG 
AAATAATAGT 
CAACCCCTGA 



ACATCTGTGG 

ATGATTTGCA 

ATCGCCCAAC 

CAACCAAGCT 

TC AC CGATTA 

AC T ATGGTTA 

CTACAACTGG 

TTCAACAATC 

C TAG AT AG AG 

CATGCATTAT 

AATACGGAGC 

AACGGTGGAA 

TGACGGTAGT 

AAACAACTGC 

GGAACTGGTC 

AGGTACTTCT 

CTGGCTATGT 

TCTATGGCTG 

AGATGGTTTC 

TGTCTGAAGG 

AGAAATGGAG 

ACCTGCTCCA 

ATAGTTCAAC 

ACGACTACCA 

T 



GATATCTACA 
AGTCGCATCT 
TTGGAGCTCG 
GTGGAAAC C A 
TGCACCTGCC 
ATGATATTCC 
GATAGAGCAT 
ACGAAATGTC 
CTAAAACCTT 
GCAAACGCCA 
AAGTAGTGAA 
TTTACCACAA 
AAAAAAGAAT 
TTACATGATG 
GTGGAGCCTA 
AACTACACAG 
AGCTCCAGAT 
TATGGACAGG 
CTAGTTGCAG 
AAGCAATCCA 
AATTCGTATT 
CAACAACCCC 
TTCACAGTCT 
ATCCTAACAA 



ACTCCGATCA 

ACGGTCGTAG 

TCACCAAGCA 

ATCGTGACTG 

ATAGAATACG 

TTATAACTAT 

ATTTCGGTAA 

ACAGCCGTTG 

CCTTAATGGT 

TTTCAAGTAA 

AAAATGGCTG 

AC CC ATGTAT 

TTTCAGATGT 

AC CG AAATG A 

TCTCCCATGG 

ATGATGAAAT 

GAAATGTTTG 

TTATTCGAAT 

CTAAAGTTTA 

GAGGATTGGA 

TAAAAATGGT 

CATCAACTGA 

AACTCAACCA 

TAATACGCAA 



ATACGTCTCT 50 

ATGTTTCAAA 100 

AGTAACGTTT 150 

GGGTTCTGCT 200 

GTGTTTATGA 250 

CCGGGAACAA 300 

TATTACTCTG 350 

AGACTTTGAA 400 

CTTGGTATCG 450 

TACAACAGAA 500 

CTGCTTATGC 550 

ATCAATAAGG 600 

AGGTACACGA 65 0 

TGAAAACTGT 7 00 

TTAGCGCAAG 750 

TGAAAAACAC 800 

TTGGTTATAC 850 

CGTTTAACTC 900 

TCGCTCAATG 95 0 

ATATACCAGA 1000 

GCTCGTTCTA 1050 

AAGTTCAAGC 1100 

CTCC AAGC AC 1150 

C AATC AAATA 12 00 
1211 



40 

2 ) INFORMATION FOR SEQ ID NO: 1016 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1222 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1016 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA 150 

60 AGTAATGTTT CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG 200 

768 
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GGGATCATCA ATGAAACCAA TCACTGACTA TGCTCCCGCT TTAGAATATG 250 

GAGTCTATGA CTCTACTGCT TCTATTGTAC ATGATGTCCC TTATAACTAT 300 

CCTGGCACTG ATACTCCACT CTACAACTGG GATCATGTCT ACTTTGGAAA 350 

CATTACAATC CAGTATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG 400 

5 AGACTTTGAA TAAGGTCGGT C TAG AT AG AG CTAAAACCTT CCTTAATGGT 450 

CTTGGTATCG ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA 5 00 

CACAACTGAA TCCAACAAAA AATATGGTGC AAGTAGTGAA AAAATGGCTG 55 0 

CTGCCTACGC TGCTTTTGCT AATGGTGGTA TTTATCACAA ACCAATGTAT 600 

ATCAATAAAA TCGTCTTTAG TGATGGTAGC GAAAAAGAAT TTTCTGATGC 650 

10 TGGTACACGA GCTATGAAAG AGACTACTGC CTATATGATG ACTGAAATGA 7 00 

TGAAAACTGT TTTAACTTAC GGAACAGGAC GTGGAGCCTA CCTACCATGG 750 

CTTCCACAAG CAGGTAAGAC AGGTACTTCT AACTATAC TG ACGAAGAAAT 800 

TGAAAAGTAT ATCAAGAACA CTGGCTACGT AGCTCCAGAT GAAATGTTTG 850 

TGGGTTATAC TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTC GAAT 900 

15 CGTTTAACTC CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA 950 

TCGCTCTATG ATGACCTACC TGTCTGAAGG AAGCAATCCA GAGGATTGGA 1000 

ATATACCAGA GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT 1050 

GCTCGTTCTA CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA 1100 

AAGTTCAAGC TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA 1150 

2 0 CTCCAAGCAC AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA 1200 

CAATCAAATA CAACCCCTGA TC 1222 



25 2 ) INFORMATION FOR SEQ ID NO: 1017 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1229 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) "TOPOLOGY: Linear 

<ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1017 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATTTAC AAT AC AG AC G 50 

AATACGTTGC CTATCCAGAC GATGAATTGC AAGTCGCTTC TAC C ATTGTT 100 

GATGTTTCTA ACGGTAAAGT CATTGCCCAG CTAGGAGCAC GCCATCAGTC 150 

AAGTAATGTT TCCTTCGGAA TTAACCAAGC AGTAGAAACA AACCGCGACT 200 

45 GGGGATCAAC TATGAAACCG ATCACAGACT ATGCTCCTGC CTTGGAGTAC 250 

GGTGTC TACG ATTCAACTGC TACTATCGTT CACGATGAGC CCTATAACTA 3 00 

CCCTGGGACA AAT ACC C C TG TTTATAACTG GGATAGGGGC TACTTTGGCA 350 

ACATCACCTT GCAATACGCC CTGCAACAAT CGCGAAACGT CCCAGCCGTG 400 

GAAACTCTAA ACAAGGTCGG ACTCAACCGC GCCAAGACTT TCC TAAATGG 450 

50 TCTCGGAATC G ACT ACC C AA GTATTCACTA CTCAAATGCC ATTTCAAGTA 5 00 

AC AC AAC C G A ATC AGAC AAA AAATATGGAG CAAGTAGTGA AAAGATGGCT 55 0 

GCTGCTTACG CTGCCTTTGC AAATGGTGGA ACTTACTATA AAC C AATGT A 600 

TATCCATAAA GTCGTCTTTA GTGATGGGAG TGAAAAAGAG TTCTCTAATG 65 0 

TCGGAACTCG TGCCATGAAA GAAACAACTG CTTACATGAT GACCGAAATG 7 00 

55 ATGAAAACTG TCCTGGCATA CGGAAGTGGT CGTGGAGCCT ATCTCCCATG 75 0 

GTTAGCGCAA GCTGGTAAGA CAGGTACTTC TAACTACACA GATGATGAAA 800 

TTGAAAAACA CATCAAGAAC ACTGGCTATG TAGCTCCAGA TGAAATGTTT 85 0 

GTTGGTTATA CTCGTAAGTA TTCTATGGCT GTATGGACAG GTTATTCGAA 9 00 

TCGTTTAACT CCTATCGTTG GAGATGGTTT CCTAGTTGCA GCTAAAGTTT 950 

60 ATCGCTCAAT GATAACGTAT CTATCAAAAG ATACTCATCC AGAAGACTGG 1000 

769 
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ACGATGCCAG ACGGACTTTT CAGAAACGGG 
AGCTCGTTCT ACGTGGAACT CACCTGCTCC 
AAAGTTCAAG CTCATCATCA GATAGTTCAA 
ACTCCAAGCA CAAATAATAG TACGACTACC 
5 ACAATCAAAT ACAACCCCTG ATCAACAAA 



GAATTTGTAT TCAAAAATGG 1050 
ACAACAACCC CCATCAACTG 1100 
CTTCACAGTC TAACTC AAC C 1150 
AATCCTAACA AT AATAC GC A 12 00 

1229 



10 



2 ) INFORMATION FOR SEQ ID NO: 1018 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 



25 



40 



45 



50 



60 



(xi) SEQUENCE DESCRIPTION: 



30 TGGTGTATAT GATTCCACTG C AAC TATGGT 
ATCCGGGCAC AAGCACACCT GTCTACAACT 
AATATTAGCC TGCAATATGC CCTTCAACAA 
TGAAACACTA AACAAGGTTG GTTTAGATAG 
GTTTGGGAAT TGACTATCCA AGTATTC AC T 

3 5 AATACAACTG AATCTAGTAA AC AGTAC GGG 
TGCGGCTTAC GCTGCATTCG CTAATGGCGG 
ACATCAATAA AGTTGTCTTT AGTGATGGTA 



ATTGAAAAGT ATATCAAGAA CACTGGCTAC 



AACAATCAAA TACAACCCCT GATCA 



ID NO: 1018 






GGGATATCTA 


CAACTCCGAT 


50 


CAAGTCGCAT 


CTACGGTCGT 


100 


AC TTGG AGCT 


CGTCACCAAG 


150 


CTGTGGAAAC 


CAACCGTGAC 


200 


TATGCCCCAG 


CCTTAGAATA 


250 


TAATGATATT 


CCTTATAACT 


300 


GGGATCGAGC 


ATATTTTGGT 


350 


TCTCGTAACG 


TGCCTGCCGT 


400 


AGCCAAAACT 


TTCCTAAATG 


450 


ACTCAAATGC 


TATTTCAAGT 


500 


GCAAGCAGTG 


AGAAAATGGC 


550 


TATTTACCAC 


AAACCAATGT 


600 


GCGAAAAAGA 


ATTTTC TG AT 


650 


GCCTATATGA 


TGACTGAAAT 


700 


ACGTGGAGCC 


TACCTACCAT 


750 


CTAACTATAC 


TGACGACGAA 


800 


GTAGCTCCAG 


ATGAAATGTT 


850 


TGTATGGACT 


GGATACTCAA 


900 


TCCTAGTTGC 


TGCCAAAGTT 


950 


GATGACCATC 


CTGGAGATTG 


1000 


AGAATTCGTA 


TTTAAAAATG 


1050 


CACAACAACC 


CCCATCAACT 


1100 


ACTTCACAGT 


CTAGCTTAAC 


1150 


CAATCCTAAC 


AACAATACGC 


1200 






1225 



2 ) INFORMATION FOR SEQ ID NO: 1019 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1439 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doub 1 e 

(D) TOPOLOGY: Linear 



770 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
5 (B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1019 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

10 TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATC C A 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 2 00 

TGAAGAGACC TTACAAGGAA AACGCTCGGT AAAAGAAATC CATCTGGATA 250 

AATATGGCAA TATGGAAAGC GTGGACACAA TTGAGGAAGG TAGTAAGGGA 3 00 

15 AACAATATCA AACTGACCAT TGATTTGGCC TTCCAAGATA GCGTGGATGC 3 50 

TTTGCTGAAA AGTTATTTCA ATTCCGAGCT AGGAAATGGT GGAGCTAAAT 4 00 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCTGTT 45 0 

TTATCCATGT CAGGGATCAA ACATGACCTG AAAACGGGAG AGTTGACTCC 500 

TGATTCCTTG GGAACGGTAA CCAATGTCTT TGTCCCAGGT TCGGTTGTCA 550 

2 0 AGGC TGCG AC CATCAGCTCA GGTTGGGAAA ATGGTGTTTT ATCAGGAAAC 600 

CAAACCTTAA CAGATCAGCC TATTGTTTTC CAAGGTTCAG CTCCAATTTA 65 0 

TTCTTGGTAT AAATTGGCAT ATGGATCTTT TCCTATTACA GCTGTGGAAG 7 00 

CCTTGGAGTA TTCATCCAAT GCTTACATGG TTCAAACCGC TCTTGGAATC 7 50 

ATGGGCCAGA CCTATCAACC AAATATGTTT GTTGGAAC C A GCAATTTGGA 800 

25 AACAGCTATG GGAAAACTTC GTGCGACCTT TGGCGAATAT GGCTTGGGGG 850 

CTGCGACCGG AATTGACCTA CCAGATGAAT CTACTGGATT TGTTC C C AAA 900 

GAGTATAGCT TTGCTAATTA CATCACCAAT TCCTTTGGGC AGTTTGATAA 950 

CTATACGCCC ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 1000 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT 1050 

3 0 AAGGGAGGAC TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA 1100 

TAAGGTCAAT ATATCCGACT CCGATATGAG CATCTTGCAC CAAGGTTTTT 1150 

ATCAGGTTGC CCATGGTACT AGTGGATTGA CAACTGGACG TGCCTTTTCA 1200 

AATGGTGCCT TGGTATC CAT TAGCGGAAAA ACAGGTACAG CCGAAAGCTA 1250 

TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 1300 

3 5 CATCTGATAA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC 1350 

AATCTAACAA ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT 1400 

GTATCAAAAA T AC CATC C AA TGAACTAGAA AGGAAATTA 143 9 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1020 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1441 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1020 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATC C A 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 15 0 

60 GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 200 

771 
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TGAAGAGACC TTACAAGGAA AACGCTCGGT AAAAGAAATC CATCTGGATA 250 

AATATGGCAA TATGGAAAGC GTGGACACAA TTGAGGAAGG TAGTAAGGGA 3 00 

AACAATATCA AACTGACCAT TGATTTGGCC TTCCAAGATA GCGTGGATGC 350 

TTTGC TGAAA AGTTATTTCA ATTCCGAGCT AGGAAATGGT GGAGCTAAAT 400 

5 ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCTGTT 450 

TTATCCATGT CAGGGATCAA ACATGACCTG AAAAC GGG AG AGTTGACTCC 500 

TGATTCCTTG GGAACGGTAA CCAATGTCTT TGTCCCAGGT TCGGTTGTCA 5 50 

AGGCTGCGAC CATCAGCTCA GGTTGGGAAA ATGGTGTTTT ATCAGGAAAC 600 

CAAACCTTAA CAGATCAGCC TATTGTTTTC CAAGGTTCAG CTCCAATTTA 650 

10 TTC TTGGT AT AAATTGGCAT ATGGATCTTT TCCTATTACA GCTGTGGAAG 700 

CCTTGGAGTA TTCATCCAAT GCTTACATGG TTCAAACCGC TCTTGGAATC 7 50 

ATGGGCCAGA CCTATCAACC AAATATGTTT GTTGGAACCA GCAATTTGGA 800 

AACAGCTATG GGAAAACTTC GTGCGACCTT TGGCGAATAT GGCTTGGGGG 850 

CTGCGACCGG AATTGACCTA CCAGATGAAT CTACTGGATT TGTTCCCAAA 900 

15 GAGTATAGCT TTGCTAATTA CATCACCAAT TCCTTTGGGC AGTTTGATAA 950 

CTATACGCCC ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 1000 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT 1050 

AAGGGAGGAC TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA 1100 

TAAGGTCAAT AT ATC CG ACT CCGATATGAG CATCTTGCAC CAAGGTTTTT 1150 

2 0 ATCAGGTTGC C C ATGGT ACT AGTGGATTGA CAACTGGACG TGCCTTTTCA 1200 

AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG CCGAAAGCTA 1250 

TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 1300 

CATC TG AT AA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC 1350 

AATCTAACAA ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT 1400 

2 5 GTATCAAAAA TACCATCCAA TGAACTAGAA AGGAAATTAT G 1441 



30 



2 ) INFORMATION FOR SEQ ID NO: 1021 



( i") SEQUENCE' CHARACTERISTIC S : 

(A) LENGTH: 1396 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1021 

45 TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTC TAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

50 GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTC CG AGC TAGG AAA TGGTGGAGCT 400 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

55 CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 55 0 

GTCAAGGCTG C G AC CATC AG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

60 AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

772 
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TGGAAACAGC 
GGGGCTGCGA 
CAAAGAGTAT 
AT AAC TAT AC 
5 AATGGTGTTC 
TGATAAGGGA 
TGAATAAGGT 
TTTTATCAGG 
TTCAAATGGT 
10 GCTATGTGGC 
GCCCCATCTG 
TACCAATCTA 



TATGGGAAAA 
CCGGAATTGA 
AGCTTTGCTA 
GCCCATGCAG 
GTGTGGCTCC 
GGACTGGGTG 
CAATATATCC 
TTGCCCATGG 
GCCTTGGTAT 
AGATGGTCAG 
ATAATCCCCA 
ACAAATGGTG 



CTTCGTGCGA 
CCTACCAGAT 
ATT AC ATC AC 
TTGGCTCAGT 
TCGTATTGTT 
ACTTGATTCA 
GACTCCGATA 
T AC T AGTGG A 
CCATTAGCGG 
CAAGCAACCA 
AATCGCTGTC 
TAGGACCTTC 



CCTTTGGCGA 
GAATCTACTG 
CAATTCCTTT 
ATGTAGCAAC 
GAAGGCATTT 
GCAACTGCAA 
TGAGCATCTT 
TTGACAACTG 
AAAAACAGGT 
ATACCAATGC 
GCAGTGGTCT 
CATTGCGCGT 



ATATGGCTTG 850 

GATTTGTTCC 900 

GGGCAGTTTG 950 

TATTGCAAAT 1000 

ATGGTAATAA 1050 

CCGACAGAGA 1100 

GCACCAAGGT 1150 

GACGTGCCTT 1200 

AC AGC C G AAA 1250 

GGTGGCCTAT 1300 

TTCCTCATAA 1350 

GACATT 1396 



15 



20 



25 



30 



2 ) INFORMATION FOR SEQ ID NO: 1022 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1428 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022 



AAAGGAGATG 
TGGAAACTTC 
GGTCTCCCAG 

3 5 AAATGACCGT 
TACAAGGAAA 
ATGGAAAGCG 
ACTGACCATT 
GTTATTTCAA 

40 GTCTATGCAG 
AGGGATCAAA 
GAACGGTAAC 
ATCAGCTCAG 
AGATCAGCCT 

45 AATTGGCATA 
TCATCCAATG 
CTATCAACCA 
GAAAACTTCG 
ATTGACCTAC 

50 TGCTAATTAC 
TGCAGTTGGC 
GCTCCTCGTA 
GGGTGACTTG 
TATCCGACTC 

55 CATGGTACTA 
GGTATCCATT 
GTCAGCAAGC 
CCCCAAATCG 
TGGTGTAGGA 

60 ACCATCCAAT 



CCTGGCATTA 
CCTTTCTTCT 
CGGAAGAAGC 
GTTGGAACCT 
ACGCTCGGTA 
TGGACACAAT 
GATTTGGCCT 
TTCCGAGCTA 
TCGCCCTTAA 
CATGACCTGA 
CAATGTCTTT 
GTTGGGAAAA 
ATTGTTTTCC 
TGGATCTTTT 
CTTACATGGT 
AATATGTTTG 
TGCGACCTTT 
CAGATGAATC 
ATCACCAATT 
TCAGTATGTA 
TTGTTGAAGG 
ATTCAGCAAC 
CGATATGAGC 
GTGGATTGAC 
AGCGGAAAAA 
AACCAATACC 
CTGTCGCAGT 
CCTTCCATTG 
G AAC TAG AAA 



GTATTTCTAC 
ATAGTAGGGA 
AGAATCCTAT 
CCTATTTGGA 
AAAGAAATCC 
TGAGGAAGGT 
TCCAAGATAG 
GGAAATGGTG 
C C C AAAAAC A 
AAAC GGG AG A 
GTCCCAGGTT 
TGGTGTTTTA 
AAGGTTCAGC 
CCTATTAC AG 
TCAAACCGCT 
TTGGAACCAG 
GGCGAATATG 
TACTGGATTT 
CCTTTGGGCA 
GCAACTATTG 
CATTTATGGT 
TGCAACCGAC 
ATCTTGCACC 
AACTGGACGT 
CAGGTACAGC 
AATGCGGTGG 
GGTCTTTCCT 
CGCGTGACAT 
GGAAATTA 



TTCTTGGGAT 
GTGTATCCAG 
CTTAAAAAAG 
AAAGCAATAT 
ATCTGGATAA 
AGTAAGGGAA 
CGTGGATGCT 
GAGCTAAATA 
GGTGC TGTTT 
GTTGACTCCT 
CGGTTGTCAA 
TCAGGAAACC 
TCCAATTTAT 
CTGTGGAAGC 
CTTGGAATCA 
CAATTTGGAA 
GCTTGGGGGC 
GTTCCCAAAG 
GTTTGATAAC 
CAAATAATGG 
AATAATGATA 
AGAGATGAAT 
AAGGTTTTTA 
GCCTTTTCAA 
CGAAAGCTAT 
CCTATGCCCC 
CATAATACCA 
TATCAATCTG 



CGAAAGGTTT 50 

TGAAAAAGCT 100 

GCTATTCTCT 150 

GAAGAGACCT 2 00 

ATATGGCAAT 2 50 

ACAATATCAA 3 00 

TTGCTGAAAA 350 

TTCTGAAGGT 400 

TATCCATGTC 450 

GATTCCTTGG 500 

GGCTGCGACC 550 

AAACCTTAAC 600 

TCTTGGTATA 650 

CTTGGAGTAT - 7 00 

TGGGCCAGAC 750 

ACAGCTATGG 800 

TGCGACCGGA 850 

AGTATAGCTT 900 

TATACGC C C A 950 

TGTTCGTGTG 1000 

AGGG AGGACT 1050 

AAGGTCAATA 1100 

TCAGGTTGCC 1150 

ATGGTGCCTT 12 00 

GTGGCAGATG 12 50 

ATCTGATAAT 13 00 

ATCTAACAAA 1350 

TATCAAAAAT 1400 
1428 



773 
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2 ) INFORMATION FOR SEQ ID NO : 1023 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023 

2 0 TGCCTCTATT TCAAAGGAGA TGCCTGGCAT TAGTATTTCT ACTTCTTGGG 50 
~" ATCGAAAGGT TTTGGAAACT TCCCTTTCTT CTATAGTAGG GAGTGTATCC 100 

AGTGAAAAAG CTGGTCTCCC AGCGGAAGAA GCAGAATCCT ATCTTAAAAA 150 

AGGCTATTCT CTAAATGACC GTGTTGGAAC CTCCTATTTG GAAAAGCAAT 200 

ATGAAGAGAC CTTACAAGGA AAACGCTCGG TAAAAGAAAT CCATCTGGAT 250 

25 AAATATGGCA ATATGGAAAG CGTGGACACA ATTGAGGAAG GTAGTAAGGG 3 00 

AAACAATATC AAACTGACCA TTGATTTGGC CTTCCAAGAT AGCGTGGATG 350 

CTTTGCTGAA AAGTTATTTC AATTCCGAGC TAGGAAATGG TGGAG CT AAA 400 

TATTCTGAAG GTGTCTATGC AGTCGCCCTT AACCCAAAAA CAGGTGCTGT 450 

TTTATCCATG TCAGGGATCA AACATGACCT GAAAACGGGA GAGTTGACTC 500 

3 0 CTGATTCCTT GGGAACGGTA ACCAATGTCT TTGTCCCAGG TTCGGTTGTC 55 0 

AAGGCTGCGA CCATCAGCTC AGGTTGGGAA AATGGTGTTT TATCAGGAAA 600 

CCAAACCTTA ACAGATCAGC CTATTGTTTT C C AAGGTTC A GCTCCAATTT 650 

ATTCTTGGTA TAAATTGGCA TATGGATCTT TTCC TATTAC AGCTGTGGAA 700 

GCCTTGGAGT ATTCATCCAA TGCTTACATG GTTCAAACCG CTCTTGGAAT 7 50 

35 CATGGGCCAG ACCTATCAAC CAAATATGTT TGTTGGAACC AGCAATTTGG 800 

AAACAGCTAT GGGAAAACTT CGTGCGACCT TTGGCGAATA TGGCTTGGGG 85 0 

GCTGCGACCG GAATTGACCT ACCAGATGAA TCTACTGGAT TTGTTCCCAA 9 00 

AGAGTATAGC TTTGC TAATT ACATCACCAA TTCCTTTGGG CAGTTTGATA 950 

AC TATACGCC CATGCAGTTG GCTCAGTATG TAGCAACTAT TGCAAATAAT 1000 

40 GGTGTTCGTG TGGCTCCTCG TATTGTTGAA GGCATTTATG GTAATAATGA 1050 

TAAGGGAGGA CTGGGTG AC T TGATTCAGCA ACTGCAACCG ACAGAGATGA 1100 

ATAAGGTCAA TATATCCGAC TCCGATATGA GCATCTTGCA CCAAGGTTTT 1150 

TATCAGGTTG CCCATGGTAC TAGTGGATTG AC AAC TGGAC GTGCCTTTTC 1200 

AAATGGTGCC TTGGTATCCA TTAGCGGAAA AACAGGTACA GCCGAAAGCT 1250 

45 ATGTGGCAGA TGGTCAGCAA GCAACCAATA CCAATGCGGT GGCCTATGCC 1300 

CCATCTGATA ATCCCCAAAT CGCTGTCGCA GTGGTCTTTC CTCATAATAC 13 50 

CAATCTAACA AATGGTGTAG GACCTTCCAT TGCGCGTGAC ATTATCAATC 1400 

TGTATCAAAA ATACCATCCA ATGAACTAGA AAGGAAATTA TG 1442 



50 



2 ) INFORMATION FOR SEQ ID NO: 1024 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 1445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: Genomic DNA 

774 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1024 

TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 
GATCGAAAGG TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

10 CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 
AAGGC TATTC TCTAAATGAC CGTGTTGGAA CCTCCTATTT GGAAAAGCAA 2 00 

TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 2 50 

TAAATATGGC AATATGGAAA GC GTGG AC AC AATTGAGGAA GGTAGTAAGG 300 
GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 3 50 

15 GCTTTGCTGA AAAGTTATTT CAATTCCGAG CTAGGAAATG GTGGAGCTAA 400 
AT ATTC TGAA GGTGTCTATG CAGTCGCCCT TAACCCAAAA ACAGGTGCTG 450 
TTTTATCCAT GTCAGGGATC AAACATGACC TGAAAACGGG AGAGTTGACT 5 00 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 
CAAGGCTGCG ACCATCAGCT CAGGTTGGGA AAATGGTGTT TTATCAGGAA 600 

20 ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 
TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 7 00 

AGCCTTGGAG TATTCATCCA ATGCTTACAT GGTTCAAACC GCTCTTGGAA 7 50 

TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 800 
GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGCGAAT ATGGCTTGGG 850 

25 GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTC C C A 900 
AAGAGTATAG CTTTGCTAAT TACATCACCA ATTCCTTTGG GCAGTTTGAT 950 

AACTATACAC CCATGCAGTT GGCTCAGTAT GTAGCAACTA TTGCAAATAA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

3 0 AATAAGGTCA ATATATCCGA CTCCGATATG AGCATCTTGC AC C AAGGTTT 1150 

TTATCAGGTT GCCCATGGTA CTAGTGGATT GACAACTGGA CGTGCCTTTT 12 00 

CAAATGGTGC CTTGGTATCC ATTAGCGGAA AAACAGGTAC AGCCGAAAGC 12 5 0 

TATGTGGCAG ATGGTCAGCA AGCAACCAAT ACCAATGCGG TGGCCTATGC 13 00 

CCCATCTGAT AATCCCCAAA TCGCTGTCGC AGTGGTCTTT CCTCATAATA 1350 

3 5 CCAATCTAAC AAATGGTGTA GGACCTTCCA TTGCGCGTGA CATTATCAAT 1400 

C TGT ATC AAA AATACCATCC AATGAACTAG AAAGGAAATT ATGCT 1445 



40 2 ) INFORMATION FOR SEQ ID NO: 1025 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

( B ) STRAIN : StrR- 0 7 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 102 5 



TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATCGAAAGA TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCGTATTT GGAAAAGCAA 2 00 

6 0 TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 250 

775 
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TAAATATGGC AATATGGAAA GCGTGGACAC AATTGAGGAA GGTAGTAAGG 300 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 3 50 

GCTTTGCTGA AAAGTTATTT C AATTC C GAG CTAGGAAATG GTGGAGCCAA 400 

GTATTCTGAG GGTGTGTATG CAGTCGCCCT TAACCCCAAA ACAGGTGCTG 450 

5 TTTTGTCTAT GTCAGGACTC AAACATGACC TGAAAACGGG AG AGTTGAC T 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 

CAAGGCTGCG ACCATCAGCT CTGGCTGGGA AAATGGTGTT TTATCAGGAA 600 

ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 

TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 7 00 

10 AGCCTTGGAG TATTCATCCA ATGC TTAC AT GGTTCAAACC GCTCTTGGAA 750 

TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 800 

GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGC GAAT ATGGCTTGGG 850 

GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 
AAGACTATAG CTTTGCTAAT TACATCACCA ATGCCTTTGG GCAGTTTGAT 950 

15 AACTATACGC CCATGCAGTT GGCTCAGTAT GTAGCAACTA TTGCAAATGA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

AATAAGGTCA ATATATCCGA CTCCGATATG AGTATCTTGC ACCAAGGATT 1150 

TTACCAAGTA TCGCATGGAA CTAGTCCCCT TACGACAGGA CGGGCGTTTT 12 00 

2 0 CAGATGGCGC CACTGTTTCT ATCAGTGGTA AGACCGGTAC AGGTGAAAGC 12 50 

TATGTAGCTG GTGGTCAAGA AGCTAATAAT ACCAATGCCG TGGCCTATGC 13 00 

TCCAACAGAA AATC C TC AAA TTGCAGTTGC AGTAGTCTTT CCTCATAATA 1350 

CCAATTTAAC CAAAAATGTT GGGCCAGCAA TTGCTCGCGA CATTATCAAT 1400 

TTATATAACC AACACCATCC AATGAATTAG AAAGGAAGCC A 1441 



25 



35 



2 ) INFORMATION FOR SEQ ID NO: 1026 



3 0 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1443 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 6 

TATTGC CTCT ATTTCAAAGG AG ATGC CTGG CATTAGTATT TCTACTTCTT 50 

45 GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TC C AGTG AAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 200 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 3 00 

50 GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 35 0 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 400 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

55 GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 65 0 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 75 0 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

60 TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGC TTG 850 

776 
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GGGGCTGCGA CCGGAATTGA CCTACCAGAT 
CAAAGAGTAT AGCTTTGCTA ATTACATCAC 
AT AAC TAT AC GCCCATGCAG TTGGCTCAGT 
AATGGTGTTC GTGTGGCTCC TCGTATTGTT 
5 TGATAAGGGA GGACTGGGTG ACTTGATTCA 
TGAATAAGGT CAATATATCC GACTCCGATA 
TTTTATCAGG TTGCCCATGG TACTAGTGGA 
TTCAAATGGT GCCTTGGTAT CCATTAGCGG 
GCTATGTGGC AGATGGTCAG CAAGCAACCA 
10 GCCCCATCTG ATAATCCCCA AATCGCTGTC 
TACCAATCTA ACAAATGGTG TAGGACCTTC 
ATCTGTATCA AAAAT AC CAT CCAATGAACT 



bAA 1L1 AL 1 vj 


ci a ttthttp p 


900 


tAAI ILL 111 




950 


7\ rpPTi^fP A AO 
AI Vj 1 AAjV^Afi^ 


TATTY^P A A AT 


1000 


bAAvjoLAl 1 1 


ATfif^T A ATA A 


1050 


a a PTrr* a a 


PP^APAPtAfSA 


1100 


mo a A fpOTT 
1 VjAVjL. A iLl 1 


a pp A APf^T 


1150 


1 1 bALrt/iL 1 w 


fSAPOTRPPTT 


1200 


AAAAACAGGT 


ACAGCCGAAA 


1250 


ATACCAATGC 


GGTGGCCTAT 


1300 


GCAGTGGTCT 


TTCCTCATAA 


1350 


CATTGCGCGT 


GACATTATCA 


1400 


AGAAAGGAAA 


TTA 


1443 



15 



2 ) INFORMATION FOR SEQ ID NO: 1027 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1443 bases 
2 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1027 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTC TATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 15 0 

35 AAAAGGCTAT TCTCTAAATG ATCGTGTAGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCTTTCCAA GATAGCGTGG 350 

ATGCTTTACT GAAAAGTTAT TTCAATTCCG AGCTAGAAAA TGGTGGAGCC 400 

40 AAGTATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 4 50 

TGTTTTGTCT ATGTCAGGGA TTAAACATGA CTTGAAAACG GGAGATTTAA 500 

CACCTGATTC CTTGGGAACA GTAACCAATG TCTTTGTCCC GGGTTCTGTT 550 

GTCAAGGCGG CGACCATCAG CTCTGGTTGG GAGAATGGAG TCTTATCAGG 600 

AAATCAGACC TTG AC AG AC C AACCGATTGT CTTCCAAGGT TCAGCTCCGA 650 

45 TTAATTCTTG GTACACTCAG GCTTACGATT CATTTCCGAT TACAGCGGTG 7 00 

GAAGCCTTGG AGTATTCTTC TAATGCCTAT ATGGTCCAAA CAGCTCTAGG 750 

TCTTATGGGG CAGACCTACC AACCCAATAT GTTTGTCGGC ACCAGCAATC 800 

TAGAGTCTGC TATGGGGAAA TTGCGTTCAA CCTTTGGTGA ATATGGCTTG 850 

GGCTCTGCGA CTGGGATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

50 CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

AT AAC TAT AC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTG GAAGGCATTT ATGGCAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

55 TTTTATCAGG TTGCTCATGG GACTAGCGGA TTG AC AAC AG GTCGTGCCTT 1200 

TTCCAATGGT GCAGCTGTAT CCATTAGTGG AAAAACAGGT ACCGCCGAAA 12 50 

GTTATGTAGC AGGTGGCCAA GAAGCCAACA ATACTAATGC TGTAGCCTAT 13 00 

GCACCATCAG ATAATCCTCA AATAGCTGTT GCTGTTGTCT TCCCTCATAA 1350 

CACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GATATTATCA 1400 

60 ATCTGTATCA AAAAT ACC AT CCAATGAACT AGAAAGGAAT TTA 1443 

777 
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2 ) INFORMATION FOR SEQ ID NO: 1028 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1028 

2 0 TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACCTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCTCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGC TAT TCTCTAAATG ACCGTGTTGG GACTTCTTAC CTGGAAAAAC 2 00 

AATACGAGGA AACCTTACAA GGAAAACGCT CGGTAAAAGA AATC CATC TG 250 

25 GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 300 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 350 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGC TAGGAAA TGGTGGAGCT 400 

AAGTATTCTG AAGGCGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGACTTGA 500 

3 0 CGCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGGTCAGTA 550 

"GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTC TTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 7 00 

GAAGCCTTGG AGTATTCATC TAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

35 AATCATGGGC C AG AC CTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT G AATC TACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

AT AAC TAT AC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

40 GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTC CG AT A TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTG AC AAC TG GACGTGCCTT 1200 

TTCAAATGGC GCC TTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

45 GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGC C TAT 13 00 

GCCCCATCTG ATAATCCCCA AATCGCTGTT GCAGTGGTCT TTCCTCATAA 13 50 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 1400 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTATGC 1446 



50 



2 ) INFORMATION FOR SEQ ID NO: 1029 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1423 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

778 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1029 



TTGCCTCTAT 

GATAGAAAGG 

10 CAGTGAAAAA 

AAGGCTATTC 

TATGAAGAGA 

TAAATATGGC 

GAAACAATAT 

15 GCTTTACTGA 
GTATTCTGAA 

TTTTGTCTAT 
CCTGATTCCT 
CAAGGCGGCG 
2 0 ACC AGACCTT 
AATTCTTGGT 
AGCTCTGGAG 
TTATGGGGCA 
GAGTCTGCTA 

2 5 GGCTGCGACT 

AAGACTATAA 
AACTATACCC 
TGGTGTTCGT 
ATAAGGGAGG 

3 0 AATAAGGTC A 

TTATCAGGTT 
CAAATGGTGC 
TATGTGGCAG 
CCCATCTGAT 
3 5 CCAATCTAAC 
CTGTATCAAA 



TTCAAAGGAG 

TTTTGGAAAC 

GCTGGTCTCC 

TC T AAATG AC 

CCTTACAAGG 

AATATGGAAA 

CAAACTG AC C 

AAAGTTATTT 

GGTGTCTATG 

GTCAGGGATT 

TGGGAACGGT 

ACCATCAGCT 

GACAGACCAG 

AT AC TC AGGC 

TATTCATCCA 

GACCTATCAA 

TGGGTAAATT 

GGGATTGATC 

CTTTGCCAAT 

CAATGCAATT 

GTGGCTCCTC 

AC TGGGTG AC 

ATATATCCGA 

GCTCATGGGA 

CTTGGTATCC 

ATGGTCAGGA 

AATCCCCAAA 

AAATGGTGTA 

AATACCATCC 



ATGCCTGGCA 

TTCCCTTTCT 

CAGCGGAAGA 

CGTGTAGGAA 

AAAACGCTCG 

GCGTGGATAC 

ATTGATTTGG 

CAATTCTGAG 

CAGTCGCCCT 

AAACATGACT 

AACCAATGTC 

CAGGTTGGGA 

TCCATTGTCT 

TTACGGTTCA 

ATGCTTATAT 

CCCAATATGT 

GCGTTCAACC 

TACCAGATGA 

TTCATTACCA 

GGCTCAGTAT 

GTATTGTTGA 

TTGATTCAGC 

CTCCGATATG 

CTAGTGGGTT 

ATTAGCGGAA 

AGCAACCAAT 

TCGCTGTCGC 

GGACCTTCCA 

AAT 



TTAGTATTTC 

TC TATAGTTG 

AGCAGAAGCC 

CCTCCTATTT 

GTAAAAGAAA 

AATTGAGGAA 

CCTTCCAAGA 

C T AG AAAATG 

TAACCCAAAA 

TGAAAACGGG 

TTTGTTCCAG 

AAATGGAGTC 

TCCAAGGTTC 

TTCCCTATCA 

GGTCCAAACA 

TTGTCGGCAC 

TTTGGCGAAT 

ATCTACTGGA 

ATGCCTTTGG 

GTAGCAACTA 

AGGCATTTAT 

AACTGCAACC 

AGTGTCTTGC 

GACAACTGGC 

AAACGGGTAC 

ACCAATGCGG 

AGTGGTCTTT 

TTGCGCGTGA 



TACTTCTTGG 50 

GGAGTGTATC 100 

TATCTTAAAA 150 

GGAAAAGCAA 200 

TCCATCTGGA 250 

GGTAGTAAGG 300 

TAGCGTGGAT 350 

GTGGAGCCAA 400 

ACAGGTGCGG 450 

AGAGTTGACG 500- 

GTTCGGTTGT 550 

TTGTCAGGAA 600 

AGCTCCCATC 650 

CAGCGGTCCA 7 00 

GCCTTAGGTC 7 50 

CAGCAATCTA 800 

ATGGCTTGGG 8 50 

TTTGTTCCCA 900 

GCAGTTTGAT 950 

TTGCAAATGA 1000 

GGTAATAATG 1050 

GACAGAGATG 1100 

AC C AAGGTTT 1150 

CGTGCCTTTT 12 00 

AGCCGAAAGC 12 5 0 

TGGCCTATGC 13 00 

CCTC ATAATA 13 5 0 

CATTATCAAT 1400 
1423 



40 2 ) INFORMATION FOR SEQ ID NO : 103 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1447 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 0 



TATTGCCTCT ATTTCAAAGG AGATGC CTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGC TAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

60 AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 
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GATAAATATG 
GGGAAACAAT 
ATGCTTTGCT 
AAATATTCTG 
5 TGTTTTATCC 
CTCCTGATTC 
GTCAAGGCTG 
AAACCAAACC 
TTTATTCTTG 
10 GAAGCCTTGG 
AATCATGGGC 
TGGAAACAGC 
GGGGCTGCGA 
CAAAGAGTAT 
15 AT AAC TAT AC 
AATGGTGTTC 
TGATAAGGGA 
TGAATAAGGT 
TTTTATCAGG 
2 0 TTCAAATGGT 
GCTATGTGGC 
GCCCCATCTG 
TACCAATCTA 
ATC TGTATC A 

25 



GCAATATGGA 

ATCAAACTGA 

GAAAAGTTAT 

AAGGTGTCTA 

ATGTCAGGGA 

CTTGGGAACG 

CGACCATCAG 

TTAACAGATC 

GTATAAATTG 

AGTATTCATC 

CAGACCTATC 

TATGGGAAAA 

CCGGAATTGA 

AGCTTTGCTA 

GCCCATGCAG 

GTGTGGCTCC 

GGACTGGGTG 

CAATATATCC 

TTGCCCATGG 

GCCTTGGTAT 

AGATGGTCAG 

ATAATCCCCA 

ACAAATGGTG 

AAAATACCAT 



AAGC GTGG AC 

CCATTGATTT 

TTCAATTCCG 

TGCAGTCGCC 

TCAAACATGA 

GTAACCAATG 

CTCAGGTTGG 

AGC CTATTGT 

GCATATGGAT 

C AATGC TT AC 

AACCAAATAT 

CTTCGTGCGA 

CCTAC C AG AT 

ATTACATCAC 

TTGGCTCAGT 

TCGTATTGTT 

ACTTGATTCA 

GACTCCGATA 

TACTAGTGGA 

CCATTAGCGG 

CAAGCAACCA 

AATCGCTGTC 

TAGGACCTTC 

CCAATGAACT 



ACAATTGAGG 

GGCCTTCCAA 

AGCTAGGAAA 

CTTAACCCAA 

CCTGAAAACG 

TCTTTGTCCC 

GAAAATGGTG 

TTTCCAAGGT 

CTTTTCCTAT 

ATGGTTCAAA 

GTTTGTTGGA 

CCTTTGGCGA 

GAATCTACTG 

CAATTCCTTT 

ATGTAGCAAC 

GAAGGCATTT 

GCAACTGCAA 

TG AGC ATC TT 

TTGACAACTG 

AAAAACAGGT 

ATACCAATGC 

GCAGTGGTCT 

CATTGCGCGT 

AGAAAGGAAA 



AAGGTAGTAA 3 00 

GAT AGC GTGG 3 50 

TGGTGGAGCT 400 

AAACAGGTGC 450 

GGAGAGTTGA 500 

AGGTTCGGTT 550 

TTTTATCAGG 600 

TCAGCTCCAA 650 

TACAGCTGTG 7 00 

CCGCTCTTGG 7 50 

ACCAGCAATT 800 

ATATGGCTTG 850 
GATTTGTTCC 900 
GGGCAGTTTG 950 

TATTGCAAAT 1000 

ATGGT AATAA 1050 

CCGACAGAGA 1100 

GCACCAAGGT 1150 

GACGTGCCTT 1200 

ACAGCCGAAA 12 50 

GGTGGCCTAT 1300 

TTCCTCATAA 1350 

GAC ATTATC A 1400 

TTATGCT 1447 



2 ) INFORMATION FOR SEQ ID NO: 1031 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1031 



ATTGCCTCTA 

45 GGATAGAAAG 
CCAGTGAAAA 
AAAGGCTATT 
ATACGAGGAA 
ATAAATATGG 

50 GGAAACAATA 
TGCTTTGCTG 
AGTATTCTGA 
GTTTTGTCTA 
GCCTGATTCC 

55 TTAAGGCCGC 
AACCAAACCT 
TTATTCTTGG 
AAGCCTTGGA 
ATCATGGGCC 

60 GGAAACAGCT 



TTTCAAAGGA 
GTTTTGGAAA 
AGCTGGTCTC 
C TCTAAATG A 
ACCTTACAAG 
CAATATGGAA 
TCAAACTGAC 
AAAAGTTATT 
AGGCGTGTAT 
TGTCAGGACT 
TTGGGAACGG 
TACCATCAGC 
TAACAGATCA 
TATAAATTGG 
GTATTCATCT 
AGACCTATCA 
ATGGGAAAAC 



GATGCCTGGC 
CTTCTCTTTC 
CCAGCGGAAG 
CCGTGTTGGG 
GAAAACGCTC 
AGCGTGGATA 
CATTGATTTG 
TCAATTCCGA 
GCAGTCGCCC 
CAAACATGAC 
TAACCAATGT 
TCAGGTTGGG 
GCCTATTGTT 
CATATGGATC 
AATGCTTACA 
ACCAAATATG 
TTCGTGCGAC 



ATTAGTATTT 
TTCTATAGTA 
AAGCAGAAGC 
ACTTCTTACC 
GGTAAAAGAA 
CAATTGAGGA 
GCCTTCCAAG 
GCTAGGAAAT 
TTAACCCCAA 
CTGAAAACGG 
CTTTGTCCCA 
AAAATGGTGT 
TTCCAAGGTT 
TTTTCCTATT 
TGGTTCAAAC 
TTTGTTGGAA 
CTTTGGCGAA 



CTACCTCTTG 50 

GGGAGTGTAT 100 

CTATCTTAAA 150 

TGG AAAAAC A 200 

ATCCATCTGG 250 

AGGTAGTAAG 300 

ATAGCGTGGA 350 

GGTGGAGCTA 400 

AACAGGTGCT 450 

GAGACTTGAC 500 

GGGTCAGTAG 550 

TTTATC AGGA 600 

CAGCTCCAAT 650 

ACAGCTGTGG 7 00 

CGCTCTTGGA 750 

CCAGCAATTT 800 

TATGGC TTGG 850 
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GGGCTGCGAC CGGAATTGAC 
AAAGAGTATA GCTTTGCTAA 
TAACT AT AC G CCGATGCAGT 
ATGGTGTTCG TGTGGCTCCT 
5 GATAAGGGAG GACTGGGTGA 
GAATAAGGTC AATATATCCG 
TTTATCAGGT TGCCCATGGT 
TCAAATGGCG CCTTGGTATC 
CTATGTGGCA GATGGTCAGC 
10 CCCCATCTGA TAATCCCCAA 
ACCAATCTAA CAAATGGTGT 
TCTGTATCAA AAATACCATC 



CTACCAGATG AATCTACTGG 
TTACATTACT AATGCCTTTG 
TGGCTCAGTA TGTAGCAACT 
CGTATTGTTG AAGGCATTTA 
CTTGATTCAG CAACTGCAAC 
ACTCCGATAT GAGCATCTTG 
ACTAGTGGAT TGACAACTGG 
CATTAGCGGA AAAACAGGTA 
AAGCAACCAA TACCAATGCG 
ATCGCTGTTG CAGTGGTCTT 
AGGACCTTCC ATTGCGCGTG 
CAATGAACTA GAAAGGAAAT 



ATTTGTTCCC 900 

GGCAGTTTGA 95 0 

ATTGC AAATG 1000 

TGGTAATAAT 1050 

CGACAGAGAT 1100 

C ACCAAGGTT 1150 

ACGTGCCTTT 1200 

CAGCCGAAAG 1250 

GTGGCCTATG 13 00 

TCCTCATAAT 13 50 

ACATTATCAA 1400 

TATGCT 1446 



15 



20 



25 



2 ) INFORMATION FOR SEQ ID NO: 1032 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032 

ATTGCCTCTA TTTCAAAGGA GATGCCTGGC ATTAGTATTT CTACTTCTTG 50 

SgmSESq GTTTTGGAAA CTTCCCTTTC TTCTATAGTT GGGAGTGTAT 100 

SSora^ AGCTGGTCTC CCAGCGGAAG AAGCAGAAGC CTATCTTAAA 150 

^5 Sggct£tt CTCTAAATGA TCGTGTTGGA ACCTCCTATT TGGAAAAGCA 200 

^tSaagag ACCTTACAAG GGAAACGCTC GGTAAAAGAA ATCCATCTGG 250 

a?^at^tgS CAATATGGAA AGTGTGGATA CAATTGAGGA aggtagtaag 300 

GGAAACAATA TCAAGCTGAC CATTGATTTG GCCTTCCAAG AT AGC GTGG A 350 

?got55£tc AAAAGTTATT TCAATTCCGA GCTAGGAAAT GGTGGAGCCA 400 

40 Stattctga GGGTGTGTAT GCAGTCGCCC TTAACCCCAA AACAGGTGCT 450 

GTTTTGTCTA TGTCAGGACT CAAACATGAC CTGAAAACGG GAGAGTTGAC 500 

??CTGATTCC TTGGGAACGG TAACCAATGT CTTTGTCC C A GGTTCGGTTG 550 

tcSgGCTGC GACCATCAGC TCTGGCTGGG AAAATGGAGT CTTATCAGGA 600 

SccaSacct TGACAGACCA GTCCATTGTC TTTCAAGGTT CAGCTCCCAT 650 

45 5S5tc™g TATACTCAGG CTTACGGTTC ATTCCCTATC ACAGCAGTCC 700 

AAGCTCTGGA GTATTCATCT AATGCCTATA TGGTCCAAAC AGCTTTAGGT 750 

5??atcgSgc aga?ctacca ACCTAATATG TTTGTCGGCA CCAGCAACCT 800 

AGAGTCTGCT ATGGGGAAAT TGCGTTCAAC CTTTGGTGAA TATGGTTTGG 850 

StTCTGCGAC CGGGATTGAC CTACCAGATG AATCTACTGG ATTTGTTCCC 900 

50 AAAGACTATA GCTTTGCTAA TTACATCACC AATGCCTTTG GGCAGTTTGA 950 

TAACTATACG CCGATGCAGT TGGCTCAGTA TGTAGCAACT ATTGCAAATG 1000 

ATGGTGTTCG TGTGGCTCCT CGTATTGTTG AAGGCATTTA TGGAAATAAT 1050 

GATAAGGGAG GCCTAGGCGA CTTGATTCAG CAACTGCAAC CGACAGAGAT 1100 

G^^GGTC AATATATCCG ACTCTGATAT GAGTATTTTG CACCAAGGTT 1150 

55 TTTATCAGGT TGCTCATGGG ACTAGTGGAT TGACAACTGG ACGTGCCTTT 1200 

TCAAATGGCG CAGCGGTATC CATTAGTGGA AAAACAGGTA CTGCCGAAAG 1250 

TTATGTTGAG GGTGGTCAAG AAGCTAACAA TACTAATGCT GTGGCCTATG 1300 

CACCATCAGA TAATCCTCAA ATCGCTGTAG CTGTTGTCTT CCCTCATAAC 1350 

ScSStTA CAAATGGTGT CGGACCTTCC ATTGCGCGCG ATATTATCAA 1400 

60 CCTCTATAAC CAACATCATC CAATGAATTA GAAAGGAACA TATGCT 1446 
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2 ) INFORMATION FOR SEQ ID NO: 1033 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



50 
100 



250 
300 
350 
400 
450 



(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1033 

on TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 
ggga?agaaa ggttttggaa ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

SSSfiGCTAT TCTC T AAATG ACCGTGTTGG AACCTCGTAT TTGGAAAAGC 2 00 
a^tSSga GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 

25 G^TAAATATG GCAATATGGA AAGCGTGGAT acaattgagg aaggtagtaa 
ggSaaacaat atcaagctga ccattgattt ggccttccaa gatagcgtgg 
atgctttgct gaaaagttat ttcaattccg agctaggaaa tggtggagcc 
aStattctg agggtgtgta tgcagtcgcc cttaacccca aaacaggtgc 

TCTTTTCTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

30 CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

3 §T?SScCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 7 00 

rlirrrTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

35 SSSmQGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

tcgaSIcagc tatgggaaaa CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

gISgctgcga CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

Saagagta? agctttgcta attacatcac caatgccttt GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

40 ^GTO?TC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

^ATCAGG ttgctcatgg GACTAGTGGA TTGACAACTG GACGTGCTTT 1200 

TTCAAATGGT GCCTTGGTAT CCATTAGTGG GAAAACAGGT ACTGCCGAAA 1250 

45 gSa^TTGC AGGTGGTCAA GAAGCCAACA ATACCAACGC GGTGGCCTAT 1300 

GCCCCATCAG ATAATCCTCA AATCGC TGTT GCCGTTGTCT TCCCTCATAA 1350 

ScSStcX ACAAATGGTG TTGGACCTTC TATTGCACGC GATATTATCA 1400 

ACCTCTATAA CCAACACCAT CCAATGAATT AGAAAGGAAC TTATGC 1446 



2 ) INFORMATION FOR SEQ ID NO : 1034 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1670 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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50 
100 
150 
200 
250 
300 
350 
400 
450 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1034 

GGAAACTGCA GAGGTCAAGG GGATTGATTT TACAACCAGT CCCAATCGTA 
GTTATCCAAA CGGACAATTT GCTTCTAGTT TTATCGGACT AGCTCAGCTC 

io Stcaaaatg AAGATGGCAG CAAGAGCTTA CTGGGAACCT CTGGAATGGA 150 

10 rAPTTCCTTG AACAGTATTC TTGCAGGGAC AG AC GGT ATT ATTACCTATG 200 
SSSScCG TGTAGGAAAT ATCGTACCAG GTACAGAACT GGTATCGCAA 
CAAACTGTGG ATGGCAAGGA TGTTT ATAC A ACATTGTCTA GTCCGCTACA 

ATCTTTCATG GAAACTCAGA TGGATGCCTT TCTAGAAAAA GTAAAAGGTA 350 

15 AGTATATGAC CGCGACCTTG GTCAGTGCAA AGACCGGTGA AATCCTCGCT 400 

aSaCCCAAC GACCTACCTT TAATGCAGAT ACTAAAGAAG GAATCACTGA 450 

GGACTTTGTT TGGCGTGATA TTCTTTATCA AAGTAACTAT GAACCAGGAT 500 

CAGCCATGAA GGTTATGACG TTAGCTTCTT CTATTGATAA TAATACCTTC 550 

cSaaStggag AATACTTCAA TAGCAGTGAA TTCAAAATAG CGGATGCGAC 600 

20 GACTCGAGAT TGGGATGTTA ATGATGGTTT GACTACTGGT GGGATGATGA 650 

CTTTCTTACA AGGTTTCGCT CACTCCAGTA ATGTTGGAAT GAGTCTACTT 700 

gaaJaaaaaS tgSgagatgc TACTTGGTTG GATTATCTAA AACGCTTTAA 750 

at^tg^Sg^ cEaactcgct ttggcttgac agatgaatac gctggtcaac 800 

TTCCAGCTGA TAATATTGTT AGTATTGCTC AAAGCTCATT TGGGCAAGGA 850 

25 ISSa CACAAACACA AATGCTTCGT GCCTTTACAG CTATTGCTAA 900 

TGATGGAGTT ATGCTGGAGC CAAAATTTAT AAGTGCTATT TATGATACTA 950 

Stcagtc TGTACGTAAG TCACAAAAAG AAATAGTAGG AAATCCTGTT 1000 

tcSaaagagg CAGCAAGCAC AACTCGAAAT CACATGATCT TAGTTGGGAC 1050 

GGACCCTC?A TATGGAACTA TGTATAATCA CTACACAGGA AAGC C AATTA 1100 

30 TAaSgTTCC TGGACAAAAT GTAGCAGTTA AATCCGGTAC GGCTCAAATC 1150 

GCTGATGAGA AAAATGGAGG ATACTTGGTT GGTTCTACCA ATTATATTTT 1200 

CTCAGTTGTG ACTATGAATC CTGCTGAAAA TCCTGATTTT ATCTTOTATG 1250 

TAACGGTTCA ACAGCCTGAG CATTATTCAG GTATCCAGTT GGGAGAATTT 1300 

GCCACCCCAA TCTTGGAGCG GGCTTCAGCT ATGAAAGAAT CTCTCAATCT 1350 

35 TcStCTCCA GCCAAAAATT TAGATAAAGT TACGACAGAA TCTTCTTATG 1400 

CAATGCCTAG CATCAAGGAT ATTTCACCTG GTGAGTTGGC GGAAGCCTTA. 1450 

Sgccgaaata TTGTGCAACC AATCGTTGTA GGTACTGGAA CAAAGATTAA 1500 

AGAGACTTCT GTAGAAGAAG GGACCAATCT TGCACCAAAC CAACAAGTTC 1550 

TCCTTTTATC GGATAAGGTA GAAGAAATTC CAGACATGTA TGGCTGGAAA 1600 

40 AAAGaScTC cSaGACCTT TGCTAAATGG TTGGATATTG AACTGGAATT 1650 
TGAAGGTTCA GGTTCCGTTG 



45 2 ) INFORMATION FOR SEQ ID NO: 1035 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1683 bases 

(B) TYPE: Nucleic acid 
5 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1035 
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AAAGAATTGG 
CAATCGTAGT 
CTCAGCTCCA 
GGAATGGAGA 
5 TACCTATGAA 
TTTCCCAACA 
CCCCTCCAGT 
AAAAGGTAAG 
TCCTCGCTAC 
10 ATCACTGAGG 
ACCAGGATCA 
ATACCTTCCC 
GATGCGACGA 
GATGATGACT 
15 GTCTACTTGA 
CGCTTTAAAT 
TGGTCAACTT 
GGCAAGGAAT 
ATTGCTAATG 
2 0 TGATACTAAC 
ATC CTGTTTC 
GTTGGGACGG 
GCCAATTATA 
CTCAAATCGC 

2 5 TATATTTTCT 

CTTGTATGTA 
GAGAATTTGC 
CTCAATCTTC 
TTC TTATGC A 

3 0 AAGC CTTACG 

AAGATTAAAG 
ACAAGTTCTC 
GC TGG AAAAA 
CTGGAATTTG 
35 



AAACTGCAGA 
TACCCAAATG 
TGAAAATGAA 
GTTCCTTGAA 
AAGGATCGTC 
AACGGTGGAT 
CCTTCATGGA 
TATATGACCG 
CACCCAACGA 
AC TTTGTTTG 
GCCATGAAGG 
AAGTGGAGAA 
CTCGAGATTG 
TTCTTACAAG 
ACAAAAAATG 
TTGGGGTTCC 
CCAGCTGATA 
TTCAGTGACA 
ATGGAGTTAT 
AATCAGTCTG 
CAAAGAGGCA 
ACCCTCTATA 
ACAGTTCCTG 
TGATGAGAAA 
CAGTTGTGAC 
ACGGTTCAAC 
CACCCCAATC 
AATCTCCAGC 
ATGCCTAGCA 
CCGAAATATT 
AGACTTCTGT 
CTTTTATCGG 
AGAGACTGCC 
AAGGTTCAGG 



GGTCAAGGGG 
GACAATTTGC 
GATGGAAGCA 
CAGTATTCTT 
TGGGCAATAT 
GGCAAGGATG 
AACTCAGATG 
CGACCTTGGT 
CCTACCTTTA 
GCGTGATATT 
TTATGACGTT 
TACTTCAATA 
GGATGTTAAT 
GTTTCGCTCA 
GGAGATGCTA 
AACTCGCTTT 
ATATTGTTAG 
CAAACACAAA 
GCTGGAGCCA 
TACGTAAGTC 
GCAAGCACAA 
TGGAACTATG 
GACAAAATGT 
AATGGAGGAT 
TATGAATCCT 
AGCCTGAGCA 
TTGGAGCGGG 
CAAAAATTTA 
TCAAGGATAT 
GTGCAACCAA 
AGAAGAAGGG 
ATAAGGTAGA 
GAGACCTTTG 
TTCCGTTGTT 



ATTGATTTTA 
TTCTAGTTTT 
AGAGTTTGCT 
GCAGGGACAG 
TGTACCCGGA 
TTTACACAAC 
GATGCCTTTC 
CAGTGCAAAG 
ATGCAGATAC 
CTTTATCAAA 
AGCTTCTTCT 
GCAGTGAATT 
GATGGTTTGA 
CTCCAGTAAT 
CTTGGTTGGA 
GGCTTGACAG 
TATTGCTCAA 
TGCTTCGTGC 
AAATTTATAA 
ACAAAAAGAA 
CTCGAAATCA 
TATAATCACT 
AGCAGTTAAA 
ACTTGGTTGG 
GCTGAAAATC 
TTATTCAGGT 
CTTCAGCTAT 
GATAAAGTTA 
TTCACCTGGT 
TCGTTGTAGG 
ACCAATCTTG 
AGAAATTCCA 
CTAAATGGTT 
CAG 



CAACCAGTCC 50 

ATCGGTCTAG 100 

GGGAACCTCT 150 

ACGGCATTAT 2 00 

AC AG AAC AAG 2 50 

CATTTCCAGC 3 00 

TAGAAAAAGT 350 

ACCGGTGAAA 400 

TAAAGAAGGA 450 

GTAACTATGA 500 

ATTGATAATA 550 

CAAAATAGCG 600 

CTACTGGTGG 650 

GTTGG AATGA 7 00 

TTATCTAAAA 7 50 

ATGAATACGC 800 

AGCTCATTTG 850 

CTTTACAGCT 900 

GTGCTATTTA 950 

ATAGTAGGAA 1000 

CATGATCTTA 1050 

ACACAGGAAA 1100 

TCCGGTACGG 115 0 

TTCTACCAAT 1200 

CTGATTTTAT 1250 

ATCCAGTTGG 13 00 

GAAAGAATCT 1350 

CGACAGAATC 1400 

GAGTTGGCGG 1450 

TACTGGAACA 1500 

CACCAAACCA 155 0 

GACATGTATG 1600 

GGATATTG AA 1650 
1683 



2 ) INFORMATION FOR SEQ ID NO: 1036 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1036 

TCAAAAAAGA ATTGGAAACT GCAGAGGTCA AGGGGATTGA TTTTACAACC 50 

5 5 AGTCCCAATC GTAGTTAC CC AAATGGACAA TTTGCTTCTA GTTTTATCGG 100 

" TCTAGCTCAG CTCCATGAAA ATGAAGATGG AAGCAAGAGT TTGCTGGGAA 15 0 

CCTCTGGAAT GGAGAGTTCC TTGAACAGTA TTCTTGCAGG G AC AG AC GGC 200 

ATTATTACCT ATGAAAAGGA TCGTCTGGGC AATATTGTAC CCGGAACAGA 250 

ACAAGTTTCC CAACAAACGG TGGATGGCAA GGATGTTTAC ACAACCATTT 3 00 

60 CCAGCCCCCT CCAGTCCTTC ATGGAAACTC AGATGGATGC CTTTC TAG AA 350 
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AAAGTAAAAG GTAAGTATAT GACCGCGACC TTGGTCAGTG CAAAGACCGG 400 

TGAAATCCTC GCTACCACCC AAC G AC C T AC CTTTAATGCA GATACTAAAG 45 0 

AAGGAATCAC TGAGGACTTT GTTTGGCGTG ATATTCTTTA TCAAAGTAAC 5 00 

TATG AAC C AG GATCAGCCAT GAAGGTTATG ACGTTAGCTT CTTCTATTGA 5 50 

5 TAATAATACC TTCCCAAGTG GAGAATACTT CAATAGCAGT GAATTCAAAA 600 

TAGCGGATGC GACGACTCGA GATTGGGATG TTAATGATGG TTTGACTACT 650 

GGTGGGATGA TGACTTTCTT ACAAGGTTTC GCTCACTCCA GTAATGTTGG 7 00 

AATGAGTCTA CTTGAACAAA AAATGGGAGA TGCTACTTGG TTGGATTATC 7 50 

TAAAACGCTT TAAATTTGGG GTTCCAACTC GCTTTGGCTT GACAGATGAA 800 

10 TACGCTGGTC AAC TTC C AGC TGATAATATT GTTAGTATTG CTCAAAGCTC 850 

ATTTGGGCAA GGAATTTCAG TGACACAAAC ACAAATGCTT CGTGCCTTTA 900 

CAGCTATTGC TAATGATGGA GTTATGCTGG AGCCAAAATT TATAAGTGCT 950 

ATTTATGATA CTAACAATCA GTCTGTACGT AAGTCACAAA AAGAAATAGT 1000 

AGGAAATCCT GTTTCCAAAG AGGCAGCAAG CACAACTCGA AATCACATGA 1050 

15 TCTTAGTTGG GACGGACCCT CTATATGGAA CTATGTATAA TC ACT AC AC A 1100 

GGAAAGCCAA TTATAACAGT TCCTGGACAA AATGTAGCAG TTAAATCCGG 1150 

TACGGCTCAA ATCGCTGATG AGAAAAATGG AGGATAC TTG GTTGGTTCTA 1200 

CCAATTATAT TTTCTCAGTT GTGACTATGA ATCCTGCTGA AAATC CTGAT 1250 

TTTATCTTGT ATGTAACGGT TCAACAGCCT GAGCATTATT CAGGTATCCA 1300 

2 0 GTTGGGAGAA TTTGCCACCC CAATCTTGGA GC GGGCTTC A GCTATGAAAG 1350 

AATCTCTCAA TCTTCAATCT CCAGCCAAAA ATTTAGATAA AGTTACGACA 1400 

GAATCTTCTT ATGCAATGCC TAGCATCAAG GATATTTCAC CTGGTGAGTT 1450 

GGCGG AAGCC TTACGCCGAA ATATTGTGCA ACCAATCGTT GTAGGTACTG 1500 

GAACAAAGAT T AAAG AG AC T TCTGTAGAAG AAGGGACCAA TCTTGCACCA 1550 

25 AACCAACAAG TTCTCCTTTT ATCGGATAAG GTAGAAGAAA TTCCAGACAT 1600 

GTATGGCTGG AAAAAAGAGA CTGCCGAGAC CTTTGCTAAA TGGTTGGATA 1650 

TTGAACTGGA ATTTGAAGGT TCAGGTTCCG TT 1682 



30 



40 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 1037 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1681 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 7 



50 



TTTACAACCA 


50 


TTTTATC GG A 


100 


TACTGGGAAC 


150 


ACAGACGGTA 


200 


AGGTACAGAA 


250 


CAACATTGTC 


300 


TTTC TAG AAA 


350 


AAAGACCGGT 


400 


ATACTAAAGA 


450 


CAAAGTAACT 


500 


TTCTATTGAT 


550 


AATTCAAAAT 


600 


TTGACTACTG 


650 


TAATGTTGGA 


700 
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ATGAGTCTAC 
AAAACGCTTT 
ACGCTGGTCA 
TTTGGGCAAG 
5 AGCT ATTGC T 
TTTATGATAC 
GGAAATCCTG 
CTTAGTTGGG 
GAAAGCCAAT 

1 0 ACGGCTC AAA 
CAATTATATT 
TTATCTTGTA 
TTGGGAGAAT 
ATCTCTCAAT 

1 5 AATCTTCTTA 
GCGGAAGCCT 
AACAAAGATT 
ACCAACAAGT 
TATGGCTGGA 

20 TGAACTGGAA 



TTGAACAAAA 

AAATTTGGGG 

ACTTCCAGCT 

GAATTTCAGT 

AATGATGGAG 

TAACAATCAG 

TTTCCAAAGA 

ACGGACCCTC 

TATAACAGTT 

TCGCTGATGA 

TTCTCAGTTG 

TGTAACGGTT 

TTGCCACCCC 

CTTCAATCTC 

TGCAATGCCT 

TACGCCGAAA 

AAAGAGACTT 

TCTCCTTTTA 

AAAAAGAGAC 

TTTGAAGGTT 



AATGGGAGAT 

TTCCAACTCG 

GATAATATTG 

GACACAAACA 

TTATGCTGGA 

TCTGTACGTA 

GGCAGCAAGC 

TATATGGAAC 

CCTGGACAAA 

GAAAAATGGA 

TGACTATGAA 

CAACAGCCTG 

AATCTTGGAG 

CAGCCAAAAA 

AGCATCAAGG 

TATTGTGCAA 

CTGTAGAAGA 

TCGGATAAGG 

TGCCGAGACC 

CAGGTTCCGT 



GCTACTTGGT 

CTTTGGCTTG 

TTAGTATTGC 

CAAATGCTTC 

GCCAAAATTT 

AGTCACAAAA 

ACAACTCGAA 

TATGTATAAT 

ATGTAGCAGT 

GGATACTTGG 

TCCTGCTGAA 

AGCATTATTC 

CGGGCTTCAG 

TTTAGATAAA 

ATATTTC AC C 

CCAATCGTTG 

AGGGACCAAT 

TAGAAGAAAT 

TTTGCTAAAT 



TGGATTATCT 7 50 

ACAGATGAAT 800 

TCAAAGCTCA 850 

GTGCCTTTAC 9 00 

ATAAGTGCTA 950 

AGAAATAGTA 1000 

ATCACATGAT 1050 

C AC TAC AC AG 1100 

TAAATCCGGT 1150 

TTGGTTC TAC 1200 

AATCCTGATT 1250 

AGGTATCCAG 1300 

CTATGAAAGA 1350 

GTTACGACAG 1400 

TGGTGAGTTG 1450 

TAGGTACTGG 15 00 

CTTGCACCAA 1550 

TCCAGACATG 1600 

GGTTGGATAT 1650 
1681 



25 



30 



35 



2 ) INFORMATION FOR SEQ ID NO: 1038 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1038 



4 0 AAAAAAGAAT 
TCCCAATCGT 
TAGCTCAGCT 
TCTGGAATGG 
TATTACCTAT 

45 TGGTATCGCA 
AGTCCGCTAC 
AGTAAAAGGT 
AAATCCTCGC 
GGAATCACTG 

50 TGAACCAGGA 
AT AATACC TT 
GCGGATGCGA 
TGGGATGATG 
TGAGTCTACT 

55 AAACGC TTT A 
CGCTGGTCAA 
TTGGGCAAGG 
GCTATTGCTA 
TT ATGAT AC T 

60 GAAATCCTGT 



TGGAAACTGC 

AGTTATCCAA 

CCATGAAAAT 

AGAGTTCCTT 

GAAAAAGACC 

AC AAAC TGTG 

AATCTTTCAT 

AAGTATATGA 

TACCACCCAA 

AGGACTTTGT 

TCAGCCATGA 

CCCAAGTGGA 

CGACTCGAGA 

ACTTTC TTAC 

TGAACAAAAA 

AATTTGGGGT 

CTTCCAGCTG 

AATTTCAGTG 

ATGATGGAGT 

AACAATCAGT 

TTCCAAAGAG 



AGAGGTCAAG 
ACGGACAATT 
GAAGATGGCA 
GAACAGTATT 
GTGTAGGAAA 
GATGGCAAGG 
GG AAAC TC AG 
CCGCGACCTT 
CGACCTACCT 
TTGGCGTGAT 
AGGTTATGAC 
GAATACTTCA 
TTGGGATGTT 
AAGGTTTCGC 
ATGGGAGATG 
TCCAACTCGC 
ATAATATTGT 
ACACAAACAC 
TATGCTGGAG 
CTGTACGTAA 
GCAGCAAGCA 



GGGATTGATT 

TGCTTCTAGT 

GCAAGAGCTT 

CTTGCAGGGA 

TATCGTACCA 

ATGTTTATAC 

ATGGATGCCT 

GGTCAGTGCA 

TTAATGCAGA 

ATTCTTTATC 

GTTAGCTTCT 

ATAGCAGTGA 

AATGATGGTT 

TCACTCCAGT 

CTACTTGGTT 

TTTGGC TTG A 

TAGTATTGCT 

AAATGCTTCG 

CCAAAATTTA 

GTCACAAAAA 

CAACTCGAAA 



TTACAACCAG 50 

TTTATCGGAC 100 

ACTGGGAACC 150 

C AG AC GGT AT 2 00 

GGTACAGAAC 2 50 

AACATTGTCT 300 

TTCTAGAAAA 350 

AAG AC CGGTG 400 

TACTAAAGAA 450 

AAAGTAACTA 500 

TCTATTGATA 550 

ATTCAAAATA 600 

TG ACTACTGG 650 

AATGTTGGAA 7 00 

GGATTATCTA 7 50 

CAGATGAATA 800 

CAAAGCTCAT 850 

TGCCTTTACA 900 

T AAGTGC TAT 950 

G AAATAGTAG 1000 

TCACATGATC 1050 
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TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 1100 

AAAGCC AATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 12 00 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 12 50 

5 TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 13 00 

TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAA 1350 

TCTCTCAATC TTCAATCTCC AGCCAAAAAT TTAGATAAAG TTACGACAGA 1400 

ATCTTCTTAT GCAATGCCTA GCATCAAGGA TATTTCACCT GGTGAGTTGG 145 0 

CGGAAGCCTT ACGCCGAAAT ATTGTGCAAC CAATCGTTGT AGGTACTGGA 1500 

10 ACAAAGATTA AAGAGACTTC TGTAGAAGAA GGGACCAATC TTGCACCAAA 1550 

CCAACAAGTT CTCCTTTTAT CGGATAAGGT AGAAGAAATT CCAGACATGT 1600 

ATGGCTGGAA AAAAG AG AC T GCCGAGACCT TTGCTAAATG GTTGGATATT 165 0 

GAACTGGAAT TTGAAGGTTC AGGTTCCGTT GTTCA 1685 

15 



2 ) INFORMATION FOR SEQ ID NO: 1039 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1679 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1039 



AAAAGAATTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 5 0 

CCAATCGTAG TTATCCAAAC GGACAATTTG CTTCTAGTTT TATCGGACTA 100 

3 5 GCTCAGCTCC ATGAAAATGA AGATGGCAGC AAGAGCTTAC TGGGAACCTC 15 0 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT TGCAGGGACA GACGGTATTA 2 00 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 25 0 

GTATCGCAAC AAAC TGTGG A TGGCAAGGAT GTTTATACAA C ATTGTC TAG 300 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT C T AG AAAAAG 350 

40 TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 400 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 45 0 

AATC AC TG AG G AC TTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATAC C TTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

45 GGATGCGACG ACTCGAGATT GGGATGTTAA TGATGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAATG 7 00 

AGTCTACTTG AACAAAAAAT GGGAGATGCT AC TTGGTTGG ATTATCTAAA 75 0 

ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

50 GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1100 

55 AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1150 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 12 00 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 13 00 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

60 TC TC AATC TT CAATCTCCAG CCAAAAATTT AGATAAAGTT AC G AC AGAAT 1400 
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CTTCTTATGC AATGCCTAGC ATCAAGGATA 
GAAGCCTTAC GCCGAAATAT TGTGCAACCA 
AAAGATTAAA GAGACTTCTG TAGAAGAAGG 
AACAAGTTCT CCTTTTATCG GATAAGGTAG 
5 GGCTGGAAAA AAGAGACTGC CGAGACCTTT 
AC TGG AATTT GAAGGTTCAG GTTCCGTTG 



TTTCACCTGG TGAGTTGGCG 1450 
ATCGTTGTAG GTACTGGAAC 1500 
GACCAATCTT GCACCAAACC 1550 
AAGAAATTCC AGACATGTAT 1600 
GCTAAATGGT TGGATATTGA 1650 

1679 



10 2 ) INFORMATION FOR SEQ ID NO: 1040 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1040 



TAAAAAGGAT 
GCCCTAATAG 
TTGGCCCAAC 
TTCTGGGATG 
TTATTAC CTA 
CAAGTTTCCC 
TAGTCCGCTA 
AAGTAAAAGG 
GAAATCCTCG 
AGGAATCACT 
ATGAACCAGG 
AATAATACCT 
AGCGGATGCG 
GTGGGATGAT 
ATGAGTC TAC 
AAAACGC TTT 
ACGCTGGTCA 
TTTGGGCAAG 
AGCTATTGCT 
TTTATGATAC 
GGAAATCCTG 
CTTAGTTGGG 
GAAAGCCAAT 
ACGGCTCAAA 
CAATTATATT 
TTATCTTGTA 
TTGGGAGAAT 
ATCTCTCAAT 
AATCTTCTTA 
GCGGAAGCCT 
AACAAAGATT 
ACCAACAAGT 
TATGGC TGGA 
TGAACTGGAA 



CTAAAAGACG C TAGTGTTG A AGGAATTGAC 
AAGCTATCCA AATGGACAAT TCGCTTCTAG 
TCCATGAAAA TGAGGATGGT AGCAAGAGTT 
GAGAGTTCCT TGAACAGTAT TCTTGCAGGG 
TGAAAAAGAT CGTCTGGGTA ATATTGTCCC 
-AACAAACGGT-AGATGG€AAG G ATGTTT ATA 
CAATCTTTCA TGGAAACTCA GATGGATGCC 
TAAGTATATG ACCGCGACCT TGGTCAGTGC 
CTACCACCCA ACGACCTACC TTTAATGCAG 
GAGGACTTTG TTTGGCGTGA TATTCTTTAT 
ATCAGCCATG AAGGTTATGA CGTTAGCTTC 
TCCCAAGTGG AGAATACTTC AATAGCAGTG 
ACGACTCGAG ATTGGGATGT TAATGAAGGT 
GACTTTCTTA CAAGGTTTCG CTCACTCCAG 
TTGAACAAAA AATGGGAGAT GCTACTTGGT 
AAATTTGGGG TTCCAACTCG CTTTGGCTTG 
ACTTCCAGCT GATAATATTG TTAGTATTGC 
GAATTTCAGT GACACAAACA CAAATGCTTC 
AATGATGGAG TTATGCTGGA GCCAAAATTT 
TAACAATCAG TCTGTACGTA AGTCACAAAA 
TTTCCAAAGA GGCAGCAAGC ACAACTCGAA 
ACGGACCCTC TATATGGAAC TATGTATAAT 
TATAACAGTT CC TGG AC AAA ATGTAGCAGT 
TCGCTGATGA GAAAAATGGA GGATACTTGG 
TTCTCAGCTG TGACTATGAA TCCTGCTGAA 
TGTAACGGTT CAACAGCCTG AGCATTATTC 
TTGCCACCCC AATCTTGGAG CGGGC TTC AG 
CTTCAATCTC CAGCCAAAAA TTTAGATAAA 
TGCAATGCCT AGCATCAAGG AT ATTTC AC C 
TACGCCGAAA TATTGTGCAA CCAATCGTTG 
AAAGAGACTT CTGTAGAAGA AGGGACCAAT 
TCTCCTTTTA TCGGATAAGG TAGAAGAAAT 
AAAAAGAGAC TGCCGAGACC TTTGCTAAAT 
TTTGAAGGTT CAGGTTCCGT CGTTC 

788 



TTCACAACTA 50 

TTTTATTGGT 100 

TGCTGGGAAC 150 

AAAGACGGTA 200 

TGGAACAGAA 250 

CAACATTGTC 3 00 

TTTC TAG AAA 350 

AAAGACCGGT 400 

ATACTAAAGA 450 

CAAAGTAACT 500 

TTCTATTGAT 550 

AATTCAAAAT 600 

TTGACTACTG 650 

TAATGTTGGA 700 

TGGATTATCT 750 

ACAGATGAAT 800 

TCAAAGCTCA 850 

GTGCCTTTAC 900 

ATAAGTGCTA 950 

AGAAATAGTA 1000 

ATC ACATGAT 1050 

CACTACACAG 1100 

TAAATCCGGT 1150 

TTGGTTCTAC 12 00 

AATCCTGATT 1250 

AGGTATCCAG 1300 

CTATGAAAGA 13 50 

GTTACG ACAG 1400 

TGGTGAGTTG 1450 

TAGGTACTGG 1500 

CTTGC ACCAA 1550 

TCCAGACATG 1600 

GGTTGGATAT 1650 
1685 
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2 ) INFORMATION FOR SEQ ID NO : 1041 



5 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1696 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



(ii) 



MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-08 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1041 

GTCTATCAAA AAAGAATTGG AAACTGCAGA GGTCAAGGGG ATTGATTTTA 
2 0 CAACCAGTCC CAATCGTAGT TATCCAAACG GACAATTTGC TTCTAGTTTT 
ATCGGACTAG CTCAGCTCCA TGAAAATGAA GATGGCAGCA AGAGCTTACT 
GGGAACCTCT GGAATGGAGA GTTCCTTGAA CAGTATTCTT GC AGGG AC AG 
ACGGTATTAT TACCTATGAA AAAGACCGTG TAGGAAATAT CGTACCAGGT 
AC AGAAC TGG TATCGCAACA AACTGTGGAT GGCAAGGATG TTTATACAAC 

2 5 ATTGTCTAGT CCGCTACAAT CTTTCATGGA AACTCAGATG GATGCCTTTC 

TAGAAAAAGT AAAAGGTAAG TATATG AC C G CGACCTTGGT CAGTGCAAAG 
ACCGGTGAAA TCCTCGCTAC CACCCAACGA CCTACCTTTA ATGCAGATAC 
TAAAGAAGGA ATCACTGAGG ACTTTGTTTG GCGTGATATT CTTTATCAAA 
GTAACTATGA ACCAGGATCA GCCATGAAGG TTATGACGTT AGCTTCTTCT 

3 0 ATTGATAATA ATACCTTCCC AAGTGGAGAA TACTTCAATA GCAGTGAATT 

CAAAATAGCG GATGCGACGA CTCGAGATTG GGATGTTAAT GATGGTTTGA 
CTACTGGTGG GATGATGACT TTCTTACAAG GTTTCGCTCA CTCCAGTAAT 
GTTGGAATGA GTCTACTTGA ACAAAAAATG GGAGATGCTA CTTGGTTGGA 
TTATCTAAAA CGCTTTAAAT TTGGGGTTCC AACTCGCTTT GGCTTGACAG 

3 5 ATGAATACGC TGGTCAACTT CCAGCTGATA ATATTGTTAG TATTGCTCAA 
AGCTCATTTG GGCAAGGAAT TTCAGTGACA CAAACACAAA TGCTTCGTGC 
CTTTACAGCT ATTGCTAATG ATGGAGTTAT GCTGGAGCCA AAATTTATAA 
GTGCTATTTA TGATAC TAAC AATCAGTCTG TACGTAAGTC ACAAAAAGAA 
ATAGTAGGAA ATCCTGTTTC CAAAGAGGCA GCAAGCACAA CTCGAAATCA 

40 CATGATCTTA GTTGGGACGG ACCCTCTATA TGGAACTATG TATAATCACT 
ACACAGGAAA GCCAATTATA ACAGTTCCTG GACAAAATGT AGCAGTTAAA 
TCCGGTACGG CTCAAATCGC TGATGAGAAA AATGGAGGAT ACTTGGTTGG 
TTCTACCAAT TATATTTTCT CAGTTGTGAC TATGAATCCT GCTGAAAATC 
CTGATTTTAT CTTGTATGTA ACGGTTCAAC AGCCTGAGCA TTATTCAGGT 

45 ATCCAGTTGG GAGAATTTGC CACCCCAATC TTGGAGCGGG CTTCAGCTAT 
GAAAGAATCT CTCAATCTTC AATCTCCAGC CAAAAATTTA GATAAAGTTA 
CGACAGAATC TTCTTATGCA ATGC CTAGC A TCAAGGATAT TTCACCTGGT 
GAGTTGGCGG AAGCCTTACG CCGAAATATT GTGCAACCAA TCGTTGTAGG 
TACTGGAACA AAGATTAAAG AGACTTCTGT AGAAGAAGGG AC C AATCTTG 

50 CACCAAACCA AC AAGTTC TC CTTTTATCGG ATAAGGTAGA AGAAATTCCA 
GACATGTATG GCTGGAAAAA AG AG AC TGC C GAGACCTTTG CTAAATGGTT 
GGATATTGAA C TGG AATTTG AAGGTTCAGG TTCCGTTGTT CAGAAG 



55 

2) INFORMATION FOR SEQ ID NO: 1042 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1696 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 bases 

(B) TYPE: Nucleic acid 



789 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



5 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 



10 



(xi) 



SEQUENCE DESCRIPTION: SEQ 



ID NO: 1042 



AAAAAAGAGT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 
TCCCAATCGT AGTTACCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 
TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT GCTGGGAACC 

15 TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA CAGACGGCAT 
TATTACCTAT GAAAAGGATC GTCTGGGAAA TATTGTCCCC GGAACGGAAC 
AAGTTTCCCA ACAAACTGTA GATGGCAAGG ATGTTTATAC G ACT ATTTC C 
AGCACCCTTC AGTCC TTC AT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 
AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 

20 AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 
GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 
TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TC TATTG AT A 
ATAATAC CTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 
GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 

25 TGGGATGATG AC TTTCTT AC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 
TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 
AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGC TTG A CAGATGAATA 
CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 
TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 

30 GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 
TTATGAT AC T AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 
GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 
TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 
AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 

35 CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 
AATTATATTT TCTCAGTTGT G AC T ATG AAT CCTGCTGAAA ATCCTGATTT 
TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 
TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAC 
TCTCTCAATC TTCAAACAAC AGCTAAAGCT TTGGAGCAAG TAAGTCAACA 

40 AAGTCCTTAT CCTATGCCTA GTGTCAAGGA TATTTCACCT GGTGATTTAG 
CAGAAGAATT GCGTCGCAAT CTTGTACAAC CCATCGTTGT GGGAACAGGA 
ACGAAGATTA AAAACAGTTC TGCTGAAGAA GGGAAGAATC TTGCCCCGAA 
TCAGCAAGTC CTTATC TTAT CTGATAAAGT AGAGGAAGTT CCAGATATGT 
ATGGTTGGAC AAAGGAG AC T GCTGAGACCC TTGCTAAGTG GC TC AAT ATA 

45 GAACTTGAAT TTCAAGGTTC GGGTTCTACT GTGCAGAAG 



2) INFORMATION FOR SEQ ID NO: 1043 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1689 



50 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) 



MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

790 
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( B) STRAIN: StrR-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1043 

5 CAAAAAAGAG TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCTAATCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGT 100 

CTAGCTCAAC TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 2 00 

TTATTACTTA TGAAAAGGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 2 50 

10 CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 300 

TAGTCCGCTC CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 450 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 5 00 

15 ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AG AAT AC TTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 650 

GTGGGATGAT GAC TTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 7 00 

ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

20 AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA C AAATGC TTC GTGCC TTT AC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT AT AAGTGC T A 950 

TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

25 GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG llOO 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 12 00 

CAATTATATT TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

30 TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATC C AG 13 00 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG C T ATGAAAG A 1350 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 1400 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTC AC C TGGTGAGTTG 1450 

GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

35 AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTG CTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT TGTTCAGAAG 1690 

40 



2) INFORMATION FOR SEQ ID NO: 1044 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 



55 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1044 



AGTTGGAAAC TGCAGAGGTC AAGGGGATTG ATTTTACAAC CAGTCCTAAT 50 

CGTAGTTACC CAAACGGACA ATTTGC TTC T AGTTTTATCG GACTAGCTCA 100 

60 GCTCCATGAA AATGAAGATG GCAGCAAGAG CTTGCTGGGA ACCTCTGGGA 150 

791 
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TGGAGAGTTC 
TATGAAAAGG 
CCAACAAACG 
TTCAGTCCTT 
5 GGCAAGTATA 
TGCAACAACG 
CCAAGGACTT 
GGGTCAACCA 
TTTCCCAGGA 

10 CGACAATTCG 
ATG AC TTTCT 
ACTTGAACAA 
TTAAGTTTGG 
CAATTGCCTG 

15 GGGGATTTCA 
CTAATGATGG 
CCAAATGATC 
TGTTTCTAAA 
GGACGGATCC 

2 0 ACTGTAACTG 
GATTGCTGAC 
TTTTCTCGGC 
TATGTAACGG 
ATTTGCCACC 

2 5 ATCTTCAATC 

TATGCAATGC 
CTTACGCCGA 
TTAAAGAGAC 
GTTCTCCTTT 

3 0 G AAAAAAG AG 

AATTTGAAGG 



TTTAAATAGA 
ATCGTCTGGG 
GTAGATGGCA 
CATGGAGACC 
TGACGGCTAC 
CAACGGCCGA 
TGTTTGGCGT 
TGAAGGTCAT 
GGAGAAGTTT 
AGATTGGGAT 
TACAAGGTTT 
AAAATGGGAG 
GGTGCCGACG 
CAGATAATAT 
GTGACCCAGA 
TGTTATGCTG 
AAACTGCTCG 
GATGCAGCTA 
GGTTTATGGA 
TTCCTGGGCA 
GAGAAAAATG 
TGTATCGATG 
TACAGCAACC 
CCAATCTTGG 
TCCAGCCAAG 
CTAGCATCAA 
AATATTGTGC 
TTCTGTAGAA 
TATCGGATAA 
ACTGC TGAAA 
GTCAGGTT 



ATTCTTGCAG 
AAATATTGTC 
AGGATGTTTA 
CAGATGAATG 
CTTGGTCAGT 
CCTTCGATGC 
GATATCCTCT 
GACGCTTGCT 
TCAATAGTAG 
GTTAATGATG 
CGCTCACTCC 
ATGC TACTTG 
CGTTTTGGTC 
TGTTAATATT 
CGCAAATGAT 
GAGCCTAAAT 
GAAATCTCAA 
GTCTAACTCG 
ACCATGTATA 
AAATGTAGCC 
GTGGTTATCT 
AATCCGGCTG 
TGAACATTAT 
AGCGGGCTTC 
AATTTAGATA 
GGATATTTCA 
AACCAATCGT 
GAAGGGACCA 
GGTAGAAGAA 
CCTTTGCTAA 



GGACAGACGG 
CCCGGAACGG 
TACGACTATT 
CCTTTCAAGA 
GCTAAAACAG 
TGATACTAAG 
ATCAAAGTAA 
GCTGCTATTG 
TGAATTAAAA 
GTTTGACGAC 
AGTAATGTTG 
GTTGGATTAT 
TGACTGATGA 
GCCATGAGTG 
TCGTGCCTTT 
TTATTAGTGC 
AAAGAAATTG 
GACTAACATG 
ACCACAGCAC 
CTCAAGTCTG 
AGTCGGGTTA 
AAAATCCTGA 
TCAGGTATCC 
AGCTATGAAA 
AAGTTACGAC 
CCTGGTGAGT 
TGTAGGTACT 
ATCTTGCACC 
ATTCCAGACA 
ATGGTTGGAT 



C ATTATT AC C 200 

AACAAGTTTC 2 50 

TCCAGCACCC 300 

AAAAGTAAAA 350 

GGGAAATTCT 400 

GAAGGACTTA 450 

CTATGAGCCA 500 

ATAATAACAC 550 

ATAGCGGATG 600 

TGGTGGGATG 650 

GAATGAGTCT 700 

CTAAACCGCT 750 

GTATTCAGGT 800 

CATTTGGTCA 850 

AC AGC TATTG 900 

CATTTATGAT 95 q 

TGGGAAATCC 1000 

ATTTTGGTAG 1050 

AGGTAAGCC A 1100 

GTACGGCTCA 1150 

ACC AACTATA 1200 

TTTTATCTTG 125 0 

AGTTGGGAGA 13 00 

GAATCTCTCA 135 0 

AGAATCTTCT 1400 

TGGCGGAAGC 1450 

GGAAC AAAGA 1500 

AAACCAACAA 1550 

TGTATGGCTG 1600 

ATTGAGCTGG 1650 
1668 



35 2 ) INFORMATION FOR SEQ ID NO: 1045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 bases 

(B) TYPE: Nucleic acid 
40 (C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

( B ) STRAIN : S trR- 1 2 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1045 



TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA GTCCCAATCG 50 

TAGTTACCCA AATGGACAAT TTGC TTC TAG TTTTATCGGT CTAGCTCAGC 100 

TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC CTCTGGAATG 150 

G AG AGTTCC T TGAACAGTAT tCTTGCAGGG ACAGACGGCA TTATTACCTA 2 00 

55 TGAAAAGGAT CGTCTGGGCA ATATTGTACC CGGAACAGAA CAAGTTTCCC 250 

AACAAACGGT GGATGGCAAG GATGTTTACA CAACCATTTC CAGCCCCCTC 3 00 

CAGTCCTTCA TGGAAAC TC A GATGGATGCC TTTCTAGAAA AAGTAAAAGG 35 0 

TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT GAAATCCTCG 400 

CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA AGGAATCACT 45 0 

60 GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT ATGAACCAGG 5 00 



792 
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ATCAGCCATG 
TCCCAAGTGG 
ACGACTCGAG 
GACTTTCTTA 
5 TTGAACAAAA 
AAATTTGGGG 
ACTTCCAGCT 
GAATTTCAGT 
AATGATGGAG 

10 TAACAATCAG 
TTTCCAAAGA 
ACGGACCCTC 
TATAACAGTT 
TCGCTGATGA 

15 TTCTCAGTTG 
TGTAACGGTT 
TTGCCACCCC 
CTTCAATCTC 
TGCAATGCCT 

20 TACGCCGAAA 
AAAGAGACTT 
TCTCCTTTTA 
AAAAAGAGAC 
TTTGAAGGTT 

25 



AAGGTTATGA 
AG AAT AC TTC 
ATTGGGATGT 
CAAGGTTTCG 
AATGGGAGAT 
TTCCAACTCG 
GATAATATTG 
GACACAAACA 
TTATGC TGG A 
TCTGTACGTA 
GGCAGCAAGC 
TATATGGAAC 
CCTGGACAAA 
GAAAAATGGA 
TGACTATGAA 
CAACAGCCTG 
AATC TTGG AG 
CAGCCAAAAA 
AGCATCAAGG 
TATTGTGCAA 
CTGTAGAAGA 
TCGGATAAGG 
TGCCGAGACC 
CAGGTTCCGT 



CGTTAGCTTC 
AATAGCAGTG 
TAATGATGGT 
CTCACTCCAG 
GCTACTTGGT 
CTTTGGCTTG 
TTAGTATTGC 
CAAATGCTTC 
GCCAAAATTT 
AGTCACAAAA 
ACAACTCGAA 
TATGTATAAT 
ATGTAGCAGT 
GGATACTTGG 
TCCTGCTGAA 
AGCATTATTC 
CGGGCTTCAG 
TTTAGATAAA 
ATATTTCACC 
CCAATCGTTG 
AGGGACCAAT 
TAGAAGAAAT 
TTTGC TAAAT 
TGTTCAGAAG 



TTCTATTGAT 
AATTCAAAAT 
TTGACTACTG 
TAATGTTGGA 
TGGATTATCT 
ACAGATGAAT 
TCAAAGCTCA 
GTGCCTTTAC 
ATAAGTGC T A 
AGAAATAGTA 
ATCACATGAT 
CACTACACAG 
TAAATCCGGT 
TTGGTTCTAC 
AATCCTGATT 
AGGTATCCAG 
CTATGAAAGA 
GTTACGACAG 
TGGTGAGTTG 
TAGGTACTGG 
CTTGCACCAA 
TCCAGACATG 
GGTTGGATAT 



AATAATACCT 550 

AGCGGATGCG 600 

GTGGGATGAT 65 0 

ATGAGTCTAC 7 00 

AAAACGCTTT 750 

ACGCTGGTCA 800 

TTTGGGCAAG 850 

AGCTATTGCT 900 

TTTATGATAC 950 

GGAAATCCTG 1000 

CTTAGTTGGG 1050 

GAAAGCCAAT 1100 

GCGGCTCAAA 1150 

C AATTATATT 12 00 

TTATCTTGTA 12 5 0 

TTGGGAGAAT 13 00 

ATCTCTCAAT 1350 

AATCTTCTTA 1400 

GCGGAAGCCT 145 0 

AACAAAGATT 1500 

ACCAACAAGT 1550 

T ATGGC TGG A 1600 

TGAACTGGAA 1650 
1680 



2 ) INFORMATION FOR SEQ ID NO: 1046 

3 0 (i) SEQUENCE CHARACTERISTICS: 

... ._ _ (A ) LENGTH: 1689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1046 

AAAAAAGAGT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

45 TCCTAATCGT AGTTACCCAA ACGGACAATT TGCTTCTAGT TTTATC GGTC 100 

TAGCTCAACT C C ATGAAAAT GAAGATGGAA GCAAGAGTTT GCTGGGAACT 15 0 

TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA AAGACGGTAT 2 00 

TATTACTTAT GAAAAGGATC GTCTGGGTAA TATTGTCCCT GGAACAGAAC 25 0 

AAGTTTCCCA ACAAACGGTA GATGGCAAGG ATGTTTATAC AACATTGTCT 3 00 

50 AGTCCGCTCC AATC TTTCAT GGAAACTCAG ATGGATGCCT TTC TAG AAAA 35 0 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 400 

AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 45 0 

GG AATC AC TG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

5 5 ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 7 00 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GG ATTATC T A 750 

AAACGC TTT A AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

60 CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

793 
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TTGGGCAAGG AATTTCAGTG ACACAAACAC 
GCTATTGCTA ATGATGGAGT TATGCTGGAG 
TTATGATACT AACAATCAGT CTGTACGTAA 
GAAATCCTGT TTCCAAAGAG GCAGCAAGCA 
5 TTAGTTGGGA CGGACCCTCT ATATGGAACT 
AAAGCCAATT ATAACAGTTC CTGGACAAAA 
CGGCTCAAAT CGCTGATGAG AAAAATGGAG 
AATTATATTT TCTCAGTTGT GACTATGAAT 
TATCTTGTAT GTAACGGTTC AACAGCCTGA 

10 TGGGAGAATT TGCCACCCCA ATCTTGGAGC 
TCTCTCAATC TTCAATCTCC AGCCAAAAAT 
ATCTTCTTAT GCAATGCCTA GCATCAAGGA 
CGGAAGCCTT ACGCCGAAAT ATTGTGCAAC 
ACAAAGATTA AAGAGACTTC TGTAGAAGAA 

15 CCAACAAGTT CTC CTTTTAT CGGATAAGGT 
ATGGCTGGAA AAAAGAGACT GCCGAGACCT 
GAACTGGAAT TTGAAGGTTC AGGTTCCGTT 



AAATGCTTCG TGCCTTTACA 900 

CCAAAATTTA TAAGTGCTAT 950 

GTCACAAAAA GAAATAGTAG 1000 

CAACTCGAAA TCACATGATC 1050 

ATGTATAATC ACTACACAGG 1100 

TGTAGCAGTT AAATCCGGTA 115 0 

GATAC TTGGT TGGTTCTACC 1200 

CCTGCTGAAA ATCCTGATTT 1250 

GCATTATTCA GGTATCCAGT 1300 

GGGCTTCAGC TATGAAAGAA 1350 

TTAGATAAAG TTACGACAGA 1400 

TATTTCACCT GGTGAGTTGG 1450 

CAATCGTTGT AGGTAC TGG A 1500 

GGGACCAATC TTGCACCAAA 1550 

AGAAGAAATT CCAGACATGT 1600 

TTGCTAAATG GTTGGATATT 1650 

GTTCAGAAG 1689 



20 



2 ) INFORMATION FOR SEQ ID NO: 1047 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1690 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



40 



45 



50 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047 



TTTACAACCA 


50 


TTTTATCGGA 


100 


TGCTGGGAAC 


150 


ACAGACGGCA 


200 


CGGAACAGAA 


250 


CAACCATTTC 


300 


TTTCAAGAGA 


350 


TAAAACAGGG 


400 


ATACAAAAGA 


450 


CAAAGTAACT 


500 


TGCTATTGAT 


550 


AGTTAAAAAT 


600 


TTGACTGGTG 


650 


TAACGTTGGG 


700 


TTGATTATCT 


750 


ACGGATGAAT 


800 


TCAGAGTTCA 


850 


GTGCCTTCAC 


900 


ATCAGTGCCA 


950 


AGAAGTTGTG 


1000 


CGCATATGGT 


1050 


CATAAGACAG 


1100 


CAAGTCTGGT 


1150 


TTGGT AC G AC 


1200 



794 
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CAATTACATT TTTTCGGCTG 
TTATTCTCTA TGTGACGGTT 
TTGGGGGAAT TTGCCAATCC 
TTCCCTTAAC CTCCAATCTA 
5 AAAGCGCTTA TGCCATGCCT 
GCGGAAGCCT TACGTCGCAA 
AACAAAAATT AAAGAATCAT 
ACCAGCAAGT TCTTCTCTTA 
TATGGTTGGA CAAAAGAGAC 
10 AGAACTTGAA TTTGAAGGTT 



TATCGATGAA CCCTGCTGAA 
CAACAGCCTG AGCATTATTC 
TATCTTGGAA AGGGCAGTGG 
CCGCTAAAAC GTTAAATCAG 
AGCATCAAGG ACATTTCACC 
TATTGTGCAA CCAATCGTTG 
C TGT AG AAG A AGGGACGGAT 
TCTGATAAAG CAGAGGAAGT 
TGCTGAGACC TTTGCTAAGT 
CGGGCTCTAC TGTGCAGAAG 



AATCCTGATT 12 50 

AGGTATTC AG 13 00 

CTATGAAAGA 13 5 0 

GTAACCAATC 1400 

TGGCGATTTG 1450 

TAGGAACAGG 15 00 

CTTGCACCTA 1550 

TCCAGATATG 1600 

GGCTCAATAT 1650 
1690 



15 



2 ) INFORMATION FOR SEQ ID NO : 1048 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1048 

30 AAAAGAGTTG GAAAC TGC AG AGGTCAAGGG GATTGATTTT ACAACCAGTC 50 

CCAATCGTAG TTACCCAAAC GGACAATTTG CTTCTAGCTT TATTGGCTTA 100 

GCCCAACTTC ATGAAAATGA GGATGGTAGT AAGAGTTTGT TAGGGACTTC 15 0 

TGGTTTGGAG AGTTC TTTAA ATACCATTCT TGCTGGGACA GACGGTATTA 20 0 

TTACCTATGA AAAAGACCGT GTAGGAAATA TCGTACCAGG TACAGAACTG 25 0 

35 GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA C ATTGTC TAG 3 00 

TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 35 0 

TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 400 

ATCCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 450 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 500 

40 AACCAGGATC AGCCATGAAG GTTATGACGT TAGCTTCTTC TATTGATAAT 550 

AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 600 

GGATGCGACG ACTCGAGATT GGGATGTTAA TGAAGGTTTG ACTACTGGTG 650 

GGATGATGAC TTTCTTACAA GGTTTCG C TC ACTCCAGTAA TGTTGGAATG 700 

AG TCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATT ATC T AAA 75 0 

45 ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 800 

CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 850 

GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 900 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 950 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1000 

50 AATCCTGTTT CCAAAGAGGC AGCAAGCACA AC TCGAAATC ACATGATCTT 1050 

AGTTGGGACG GACCCTCTAT ATGG AAC TAT GTATAATCAC TACACAGGAA 1100 

AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 115 0 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 12 0 0 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1250 

55 TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 13 00 

GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1350 

TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 14 00 

CTTCTTATGC AATGCCTAGC ATCAAGGATA TTTCACCTGG TGAGTTGGCG 1450 

GAAGCCTTAC GCCGAAATAT TGTGCAACCA ATC GTTGTAG GTACTGGAAC 15 00 

60 AAAGATTAAA GAG AC TTC TG TAGAAGAAGG G AC C AATCTT GCACCAAACC 1550 

795 
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AACAAGTTCT C CTTTT ATCG GATAAGGTAG AAGAAATTCC AGACATGTAT 1600 
GGCTGGAAAA AAGAGACTGC TGAAACCTTT GCTAAATGGT TGGATATTGA 1650 
GTTGGAATTT GAAGGTTCAG GTTCCGTCGT TC 1682 



2 ) INFORMATION FOR SEQ ID NO: 1049 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1241 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R690 



20 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1049 



GCAGCGTTGC GTGATACCGT TGAAAAAACC ATTAAAAACT GTTTGGATTT 50 

TGAAAGGAGA CAGGAGCATG AATAGAATAA AAGTTGCAAT ACTGTTTGGG 100 

25 GGTTGCTCAG AGGAGCATGA CGTATCGGTA AAATCTGCAA TAGAGATAGC 150 

CGCTAACATT AATAAAGAAA AATACGAGCC GTTATACATT GGAATTACGA 2 00 

AATCTGGTGT ATGGAAAATG TGCGAAAAAC CTTGCGCGGA ATGGGAAAAC 250 

GACAATTGCT ATTCAGCTGT ACTCTCGCCG GATAAAAAAA TGCACGGATT 3 00 

AC TTGTTAAA AAGAACCATG AATATGAAAT CAACCATGTT GATGTAGCAT 350 

3 0 TTTCAGCTTT GCATGGCAAG TCAGGTGAAG ATGG ATC CAT AC AAGGTC TG 400 

TTTGAAT-TGT CCGGTATCCC TTTTGTAGGC TGCGATATTC AAAGCTCAGC 450 

AATTTGTATG GACAAATCGT TGACATACAT CGTTGCGAAA AATGCTGGGA 500 

TAGC TACTCC CGCCTTTTGG GTTATTAATA AAGATGATAG GCCGGTGGCA 550 

GCTACGTTTA CCTATCCTGT TTTTGTTAAG CCGGCGCGTT CAGGCTCATC 600 

3 5 CTTCGGTGTG AAAAAAGTCA ATAGCGCGGA CGAATTGGAC TACGCAATTG 650 

AATCGGCAAG ACAATATGAC AGCAAAATCT TAATTGAGCA GGCTGTTTCG 700 

GGCTGTGAGG TCGGTTGTGC GGTATTGGGA AACAGTGCCG CGTTAGCTGT 7 50 

TGGCGAGGTG GACCAAATCA GGCTGCAGTA CGGAATC TTT CGTATTCATC 8 00 

AGGAAGTCGA GCCGGAAAAA GGCTCTGAAA ACGCAGTTAT AACCGTTCCC 850 

40 GCAGACCTTT CAGCAGAGGA GCGAGGACGG ATACAGGAAA CGGCAAAAAA 900 

AATATATAAA GCGCTCGGCT GTAGAGGTCT AGCCCGTGTG GATATGTTTT 950 

TACAAGATAA CGGCCGCATT GTACTGAACG AAGTCAATAC TCTGCCCGGT 1000 

TTCACGTCAT ACAGTCGTTA TCCCCGTATG ATGGCCGCTG CAGGTATTGC 1050 

ACTTCCCGAA CTGATTGACC GCTTGATCGT ATTAGCGTTA AAGGGGTGAT 1100 

45 AAGCATGGAA ATAGGATTTA CTTTTTTAGA TGAAATAGTA CACGGTGTTC 1150 

GTTGGGACGC TAAATATGCC AC TTGGG AT A ATTTCACCGG AAAACCGGTT 12 00 

GACGGTTATG AAGTAAATCG CATTGTAGGG ACATACGAAT T 1241 



2 ) INFORMATION FOR SEQ ID NO : 1050 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1249 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 



796 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1050 

TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

10 GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 2 50 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 3 00 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

15 CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTC GGTGTG A AAAAAGTCAA 650 

20 TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

2 5 CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

3 0 TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGC C A 12 00 

CTTGGGATAA TTTC AC CGG A AAACCGGTTG ACGGTTATGA AGTAAATCG 1249 



35 2 ) INFORMATION FOR SEQ ID NO : 1051 



40 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1272 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1051 



TATACCGAGC AAGCGTTGCG TGATACCGTT 
TTTGGATTTT GAAAGGAGAC AGGAGCATGA 
CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 
AGAGATAGCC GCTAACATTA ATAAAGAAAA 
GAATTACGAA ATCTGGTGTA TGGAAAATGT 
TGGGAAAACG ACAATTGCTA TTCAGCTGTA 
GCACGGATTA CTTGTTAAAA AGAACCATGA 
ATGTAGCATT TTCAGCTTTG CATGGCAAGT 
CAAGGTCTGT TTGAATTGTC CGGTATCCCT 
AAGCTCAGCA ATTTGTATGG ACAAATCGTT 



GAAAAAACCA TTAAAAACTG 50 

ATAGAATAAA AGTTGCAATA 100 

GTATCGGTAA AATCTGCAAT 150 

ATACGAGCCG TTATACATTG 2 00 

GCGAAAAACC TTGCGCGGAA 2 50 

CTCTCGCCGG ATAAAAAAAT 3 00 

ATATGAAATC AACCATGTTG 350 

CAGGTGAAGA TGGATCCATA 400 

TTTGTAGGCT GCGATATTCA 450 

GACATACATC GTTGCGAAAA 500 



797 
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ATGCTGGGAT AGCTACTCCC GCCTTTTGGG TTATTAATAA AGATGATAGG 550 

CCGGTGGCAG CTACGTTTAC CTATCCTGTT TTTGTTAAGC CGGCGCGTTC 600 

AGGCTCATCC TTCGGTGTGA AAAAAGTCAA TAGCGCGGAC GAATTGGACT 650 

ACGCAATTGA ATCGGCAAGA CAATATGACA GCAAAATCTT AATTGAGCAG 7 00 

5 GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG GTATTGGGAA ACAGTGCCGC 75 0 

GTTAGTTGTT GGCGAGGTGG ACCAAATCAG GCTGCAGTAC GGAATCTTTC 800 

GTATTCATCA GGAAGTCGAG CCGGAAAAAG GCTCTGAAAA CGCAGTTATA 850 

ACCGTTCCCG C AG AC CTTTC AGCAGAGGAG CGAGGACGGA TACAGGAAAC 900 

GGCAAAAAAA ATATATAAAG CGCTCGGCTG TAGAGGTCTA GCCCGTGTGG 950 

10 ATATGTTTTT ACAAGATAAC GGCCGCATTG TACTGAACGA AGTCAATACT 1000 

CTGCC CGGTT TCACGTCATA CAGTCGTTAT CCCCGTATGA TGGCCGCTGC 1050 

AGGTATTGCA CTTCCCGAAC TGATTGACCG CTTGATCGTA TTAGCGTTAA 1100 

AGGGGTGATA AGCATGGAAA TAGGATTTAC TTTTTTAGAT GAAATAGTAC 115 0 

ACGGTGTTCG TTGGGACGCT AAATATGCCA CTTGGGATAA TTTCACCGGA 12 00 

15 AAACCGGTTG ACGGTTATGA AGTAAATCGC ATTGTAGGGA CATACGAATT 125 0 

GGCTTGAATC GC TTTTTG AA GG 127 2 



20 2 ) INFORMATION FOR SEQ ID NO: 1052 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1237 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1052 



TCCCCCGGCA T ACGGCC TAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGC TC AG A GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

40 ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 2 50 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG AC AATTGC TA TTCAGCTGTA 3 00 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 3 50 

ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

45 TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGC G AAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 55 0 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 7 00 

50 GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 75 0 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

GCTCTGAAAA CGCAGTTATA ACCGTTCCCG C AG AC CTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

55 TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 105 0 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 12 00 

60 CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTA 1237 

798 
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2 ) INFORMATION FOR SEQ ID NO: 1053 



5 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1263 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus faeclum 

(B) STRAIN: R581 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1053 

2 0 CATACGGCCT ATT AT AC CG A GCAAGCGTTG CGTGATACCG TTGAAAAAAC 50 

CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT GAATAGAATA 100 

AAAGTTGCAA T AC TGTTTGG GGGTTGCTCA GAGGAGCATG ACGTATCGGT 150 

AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA AAATACGAGC 2 00 

CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT GTGCGAAAAA 2 50 

25 CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG TACTCTCGCC 3 00 

GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT GAATATGAAA 3 50 

TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA GTCAGGTGAA 400 

GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC CTTTTGTAGG 450 

CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG TTGACATACA 500 

3 0 TCGTT GCGAA AAATGC TGGG ATAGCTACTC CCGCCTTTTG GGTTATTAAT 55 0 

AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG TTTTTGTTAA 600 
GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC AATAGCGCGG 650 

ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA CAGCAAAATC 7 00 

TTAATTGAGC AGGC TGTTTC GGGCTGTGAG GTCGGTTGTG CGGTATTGGG 7 50 

3 5 AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC AGGCTGCAGT 800 
ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA AGGCTCTGAA 850 
AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG AGCGAGGACG 900 
GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC TGTAGAGGTC 950 

TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT TGTACTGAAC 1000 

40 GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT ATCCCCGTAT 1050 

GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC CGCTTGATCG 1100 

TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT ACTTTTTTAG 1150 

ATGAAATAGT ACACGGTGTT CGTTGGGACG C TAAATATGC C AC TTGGG AT 12 00 

AATTTC AC CG GAAAACCGGT TGAC GGGTAT GAAAGTAAAT CGCATTGTAG 1250 

45 GGACATTCGA ATT 12 63 



50 



2 ) INFORMATION FOR SEQ ID NO : 1054 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1232 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Enterococcus faecalis 

799 
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(B) STRAIN: R610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1054 

* CGGCCTATTA TNCCGAGCAA GCGTTGCGTG ATACCGTTGA AAAAACCATT 50 

AAAAACTGTT TGGATTTTGA AAGGAGACAG GAGCATGAAT AGAATAAAAG 100 

TTGCAATACT GTTTGGGGGT TGCTCAGAGG AGCATGACGT ATCGGTAAAA 150 

TCTGCAATAG AGATAGCCGC TAACATTAAT AAAGAAAAAT ACGAGCCGTT 200 

ATACATTGGA ATTACGAAAT CTGGTGTATG GAAAATGTGC GAAAAACCTT 250 

10 GCGCGGAATG GGAAAACGAC AATTGC TATT CAGCTGTACT CTCGCCGGAT 300 

AAAAAAATGC ACGGATTACT TGTTAAAAAG AACCATGAAT ATGAAATCAA 350 

CCATGTTGAT GTAGCATTTT CAGCTTTGCA TGGCAAGTCA GGTGAAGATG 400 

GATCCATACA AGGTCTGTTT GAATTGTCCG GTATCCCTTT TGTAGGCTGC 450 

GATATTCAAA GCTCAGCAAT TTGTATGGAC AAATCGTTGA CATACATCGT 500 

15 TGCGAAAAAT GCTGGGATAG CTACTCCCGC CTTTTGGGTT ATTAATAAAG 550 

ATGATAGGCC GGTGGCAGCT ACGTTTACCT ATCCTGTTTT TGTTAAGCCG 600 

GCGCGTTCAG GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA 650 

ATTGGACTAC GCAATTGAAT CGGC AAGAC A AT ATG AC AG C AAAATC TTAA 7 00 

TTGAGCAGGC TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC 7 50 

20 AGTGCCGCGT TAGTTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG 800 

AATCTTTCGT ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG 850 

CAGTTATAAC CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA 900 

CAGGAAACGG CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC 950 

CCGTGTGGAT ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG 1000 

25 TCAATACTCT GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG 1050 

GCCGCTGCAG GTATTGCACT TCCCGAACTG ATTGAC CGC T TGATCGTATT 1100 

AGCGTTAAAG GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA 1150 

AATAGTACAC GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT 12 00 

TC AC CGGAAA ACCGGTTGAC GGTTATAAGT AA 1232 

30 



2 ) INFORMATION FOR SEQ ID NO: 1055 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055 



TACCGAGCAA 
5 0 TGGATTTTGA 
GTTTGGGGGT 
AGATAGCCGC 
ATTACGAAAT 
GGAAAACGAC 
55 ACGGATTACT 
GTAGCATTTT 
AGGTCTGTTT 
GCTCAGCAAT 
GCTGGGATAG 
60 GGTGGCAGCT 



GCGTTGCGTG 
AAGGAGACAG 
TGCTCAGAGG 
TAACATTAAT 
CTGGTGTATG 
AATTGC TATT 
TGTTAAAAAG 
CAGCTTTGCA 
GAATTGTCCG 
TTGTATGGAC 
CTACTCCCGC 
ACGTTTACCT 



ATACCGTTGA 
GAGCATGAAT 
AGCATGACGT 
AAAGAAAAAT 
GAAAATGTGC 
CAGCTGTACT 
AACCATGAAT 
TGGCAAGTCA 
GTATCCCTTT 
AAATCGTTGA 
CTTTTGGGTT 
ATCCTGTTTT 



AAAAACCATT 
AGAATAAAAG 
ATCGGTAAAA 
ACGAGCCGTT 
GAAAAACCTT 
CTCGCCGGAT 
ATGAAATCAA 
GGTGAAGATG 
TGTAGGCTGC 
CATACATCGT 
ATTAATAAAG 
TGTTAAGCCG 



AAAAACTGTT 


50 


TTGCAATACT 


100 


TCTGCAATAG 


150 


ATACATTGGA 


200 


GCGCGGAATG 


250 


AAAAAAATGC 


300 


CCATGTTGAT 


350 


GATCCATACA 


400 


GATATTCAAA 


450 


TGCGAAAAAT 


500 


ATGATAGGCC 


550 


GCGCGTTCAG 


600 



800 
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GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA ATTGGAC T AC 650 

GCAATTGAAT CGGCAAGACA AT ATG AC AG C AAAATC TTAA TTGAGCAGGC 7 00 

TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC AGTGCCGCGT 7 50 

TAGCTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG AATCTTTCGT 800 

5 ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG CAGTTATAAC 85 0 

CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA CAGGAAACGG 900 

CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC CCGTGTGGAT 950 

ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG TCAATACTCT 1000 

GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG GCCGCTGCAG 1050 

10 GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT AGCGTTAAAG 1100 

GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA AATAGTACAC 115 0 

GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT TCACCGGAAA 1200 

AC CGGTTG AC GGTTAGAA 1218 



15 



30 



60 



2 ) INFORMATION FOR SEQ ID NO: 1056 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1265 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1056 



AATCACACCG CATACGGCCT ATTATAC C G A GCAAGCGTTG CGTGATACCG 50 

TTGAAAAAAC CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT 100 

35 GAATAGAATA AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG 150 

ACGTATCGGT AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA 20 0 

AAATACGAGC CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT 25 0 

GTGCGAAAAA CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG 3 00 

TACTCTCGCC GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT 3 50 

40 GAATATGAAA TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA 400 

GTCAGGTGAA GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC 450 

CTTTTGTAGG CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG 500 

TTGACATACA TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG 550 

GGTTATTAAT AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG 600 

45 TTTTTGTTAA GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC 650 

AATAGCGCGG ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA 7 00 

CAGCAAAATC TTAATTGAGC AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG 750 

CGGTATTGGG AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC 800 

AGGCTGCAGT ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA 850 

50 AGGCTCTGAA AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG 900 

AGCGAGGACG GATACAGGAA AC GGC AAAAA AAATATATAA AGCGCTCGGC 950 

TGTAGAGGTC TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT 1000 

TGTACTGAAC GAAGTCAATA CTCTGCCCGG TTTC AC GTC A TACAGTCGTT 1050 

ATCCCCGTAT GATGGCCGCT GCAGGTATTG CACTTCCCGA AC TG ATTG AC 1100 

55 CGCTTGATCG TATTAGC GTT AAAGGGGTGA TAAGCATGGA AATAGGATTT 1150 

ACTTTTTTAG ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC 1200 

CACTTGGGAT AATTTCACCG GAAAACCGGT TGACGGTTAT GAAGTAAATC 1250 

GCATTGTAGG GACAT 1265 



801 
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2 ) INFORMATION FOR SEQ ID NO: 1057 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1057 

TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TG AT AC CGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

20 ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

" W GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

25 ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

30 TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 65 0 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GG AAGTC GAG CCGGAAAAAG 850 

3 5 GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

40 CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 12 00 

CTTGGGATAA TTTCACCGGA AAACCGGTTG AC GGTTATGA AGTAAATCGC 12 50 

ATTGTAGGGA CATACGAAT 12 69 



45 



2 ) INFORMATION FOR SEQ ID NO: 1058 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1169 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R42 0 

60 

802 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1058 

CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

5 ATTCTCCAGA ATACTCAGTG TC AC TAACC T CAGCAGCAAG TGTGATCCAA 150 

GCTATTAACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 2 00 

AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 2 50 

CTTGGCTAGA AGATCACAAA AACTGTC AC C AGCTGACTTT TTCTAGCCAA 3 00 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

10 CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 400 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 45 0 

ATGAACAAAT GGC TCTTGC A TCAACTTGCT GATACCATGG GAATCGCTAG 500 

TGCTCCCACT TTGC TTTT AT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TC AAG AC CAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

15 GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGAC AAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 700 

AGGCG AT AG C GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 7 50 

TTGACGATTG GTGC TTGTG A TGCGATTTCT CTTGTCGACG GTTTTTTTGA 800 

TTTTGAAGAG AAATACCAAT TAATCAGCGC CACGATCACT GTCCCAGCAC 850 

20 CATTGCCTCT CGCGCTTGAA TCACAGATCA AGGAGCAGGC AC AGC TGCTT 900 

TATCGAAACT TGGGATTGAC GGGTCTGGCT CGAATCGATT TTTTCGTCAC 950 

CAATCAAGGA GCGATTTATT TAAACGAAAT CAACACCATG CCGGGATTTA 1000 

CTGGGCACTC CCGCTACCCA GCTATGATGG CGGAAGTCGG GTTATCCTAC 1050 

GAAATATTAG TAGAGCAATT GATTGCACTG GCAGAGGAGG ACAAACGATG 1100 

2 5 AACACATTAC AATTGATCAA TAAAAACCAT CCATTGAAAA AAAATCAAGA 1150 

GCCCCCGCAC TTAGTGCTA 1169 



30 2 ) INFORMATION FOR SEQ ID NO: 1059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

{ D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1059 



CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

ATTCTCCAGA ATACTCAGTG TCACTAGCCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTGACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 200 

50 AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 250 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 300 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 400 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 450 

55 ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 500 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGACCAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGACAAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 7 00 

60 AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 7 50 
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TTGACGATTG GTGCTTGTGA TGCGATTTCT 
TTTTGAAGAG AAATACCAAT TAATCAGCGC 
CATTGCCTCT CGCGCTTGAA TCACAGATCA 
TATCGAAACT TGGGATTGAC GGGTCTGGCT 
5 CAATCAAGGA GCGATTTATT TAAACGAAAT 
CTGGGCACTC CCGCTACCCA GCTATGATGG 
GAAATATTAG TAGAGCAATT GATTGCACTG 
AACACATTAC AATTGATCAA T AAAAAC CAT 
GCCCCCGCAC TTAGTG 



CTTGTCGACG GTTTTTTTGA 800 

CACGATCACT GTCCCAGCAC 850 

AGGAGCAGGC ACAGCTGCTT 900 

CGAATCGATT TTTTCGTCAC 950 

CAACACCATG CCGGGATTTA 1000 

CGGAAGTCGG GTTATCCTAC 1050 

GCAGAGGAGG ACAAACGATG 1100 

CCATTGAAAA AAAATCAAGA 1150 

1166 



2 ) INFORMATION FOR SEQ ID NO: 1060 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1028 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 
25 (B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060 

AACATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAATACACC 50 

3 0 GTTTCTTTAG CTTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CTATGACTAC GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGT AC TTGT A TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 200 

TTGGATACGA AACATAAACA GAAAATACAG CCGCTATTCG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AGCAAACGTT GGTACCTGAT GTTTTATTTC 3 00 

3 5 CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 3 50 

GAATTGATGA AGCTGCCTTA TGTAGGC TGC GGGGTGGCAG GTTCTGCCTT 400 

ATGTATGAAC AAATGGCTGC TGCATCAAGC TGCAGCAGCC ATTGGCGTAC 45 0 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGAACAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTC CCAGTTTTCT TTAAGCCTAA 55 0 

40 TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCCGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 7 00 

CTC TTTGACT GTC GGTGCTT GTGACGCCAT TTCATTAGTA GACGGCTTTT 750 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAAAT CACCGTCCCT 800 

45 GCGCCATTGC CTGAAACGAT TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 850 

GCTCTATCGT AGTC TTGGTC TTAAAGGTCT TGCTCGCATC G AC TTTTTTG 900 

TCACGGAGCG AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGG TCGGCTTATC 1000 

CTATCAAGAA CTACTACAAA AACTGCTT 1028 

50 



2 ) INFORMATION FOR SEQ ID NO : 1061 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exit erococcus casseli flaws 
5 (B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1061 

AATATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAGTACGCC 50 

10 GTTTCTTTAG CCTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CGATGACTAT GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGTATTTGTA TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 2 00 

TTGGATACGA AACATAAACA G AAAATC C AG CCGCTTTTTG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AACAAACGTT GGTTCCTGAT GTTTTATTTC 300 

15 CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 3 50 

GAATTGATGA AACTACCTTA TGTAGGCTGC GGGGTGGCAG CCTCTGCCTT 400 

ATGTATGAAC AAATGGCTGC TGCATCAAGC AGCAGAAGCG ATTGGCGTAC 450 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGATCAA 500 

. ATCGAAGCTT TTATCCAGAC CCATGGCTTT CCGGTTTTTT TTAAGCCTAA 550 

2 0 TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCAGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 7 00 

CTCTTTGACT GTCGGTGCTT GTGACGCTAT TTCATTAGTA GACGGCTTTT 750 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAGAT CACCGTTCCT 8 00 

25 GCACCATTGC CTGAAACGAT TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 85 0 

GCTCTATCGT AGTCTTTGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 900 

TCACGGATCA AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 95 0 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGA TCGGCTTATC 10 00 

CTATCAAGAA C T AC TAC AAA AACTGCTTGT 1030 

30 



2 ) INFORMATION FOR SEQ ID NO: 1062 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flavus 
45 (B) STRAIN: R754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1062 

AAACATGAAA AAAATCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

50 CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT 200 

TGTTGGATAC GAAACATAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 2 50 

GGC TTTTGGC TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 300 

55 TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 350 

TTGAATTGAT GAAGCTGCCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 400 

TTATGTATGA ACAAATGGCT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 450 

ACAAAGTGCT CCTACGATTC TCTTGACAAA TCAAGCCAAC CAGCAAGAAC 500 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TCCCAGTTTT CTTTAAGCCT 550 

60 AATGAAGCGG GCTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 
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AATCGCTTCT GCCTTAAAAG 
TACAAAAAAA TATTGCCGGT 
G AC TCTTTG A CTGTCGGTGC 
TTTC G ATTTT GAAGAAAAGT 
5 CTGCGCCATT GCCTGAAACG 
CTGCTCTATC GTAGTCTTGG 
TGTCACGGAG CGAGGAGAAC 
GCTTTACGAG TCACTCCCGC 
TCCTATCAAG AACTACTACA 



AAGCCTTTAC TTATTGTTCC 
GTTGAGATCG GTTGCGGTAT 
TTGTGACGCC ATTTCATTAG 
ACCAGCTGAT CAGCGCCAAA 
ATTGAAACCA AGGTCAAAGA 
TCTTAAAGGT CTTGCTCGCA 
TATACTTGAA TGAAATCAAT 
TATCCTGCCA TGATGGCAGC 
AAAACTGCTT G 



GCAGTGCTCC 650 

TTTGGGCAAC 7 00 

TAGACGGCTT 7 50 

ATCACCGTCC 800 

ACAAGCTCAG 850 

TCGACTTTTT 900 

ACTATGCCGG 950 

GGTCGGCTTA 1000 
1031 



2 ) INFORMATION FOR SEQ ID NO: 1063 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 103 0 bases 

(B) TYPE: Nucleic acid 

(C) S TRANDEDNE S S : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
25 (B) STRAIN: R775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1063 

AACATGAAAA AAATCGCCAT TATTTTTGGA GGCAATTCAC CGGAATACAC 50 

30 S55ttc5555: GCITCAGCAA CTAGCGCAAT CGAAGCACTC CAATC^CTC 100 

CCTATGACTA CGACCTCTCT TTGATCGGGA TCGCCCCAGA TGCTATGGAT 150 

?Sg?a2ttgt ATACAGGAGA ACTGGAAAAC ATCCGACAAG ACACGTGGTT 200 

GTTGGATACG AAACATAAAC AGAAAATACA GCCGCTATTT GAAGGAAACG 2 50 

g?S?tcS§? ^Stgaagag CAGCAAACGT TAGTACCTGA TATTTTATTT 300 

35 CCCATTATGC ATGGCAAATA CGGGGAAGAT GGCAGTATCC AAGGATTGTT 350 

TGAATTGATG AAACTACCTT ATGTAGGTTG CGGGGTGGCA GGTTCTGCCT 400 

TATGTATGAA CAAATGGCTG CTGCATCAAG CTGCAGCAGC CATTGGCGTA 45 0 

CAAAGTGCTC CTACGATTCT CTTGACAAAT CAAGCCAACC AGCAAGAACA 500 

AATCGAAGCT TTTATCCAGA CCCATGGCTT CCCAGTTTTC TTTAAGCCTA 550 

40 ATGAAGCGGG CTCTTCAAAA GGGATCACTA AAGTCACCTG CGTTGAAGAA 600 

ATCGCTTCTG CCTTAAAAAA AGCCTTTACT TATTGTTCCG CAGTGCTCCT 650 

ACAAAAAAAT ATTGC CGGTG TTGAGATCGG TTGCGGTATT TTGGGCAACG 700 

AC TCTTTG AC TGTCGGTGCT TGTGACGCCA TTTCATTAGT AGACGGCTTT 750 

TTCGATTTTG AAGAAAAGTA CCAGCTGATC AGCGCCAAAA TCACCGTCCC 800 

45 TGCGCCATTG CCTGAAACGA TTGAAACCAA GGTCAAAGAA CAAGCTCAGC 850 

TGCTCTATCG TAGTCTTGGT CTTAAAGGTC TTGCTCGCAT CGACTTTTTT 900 

GTCACGGATC AAGGAGAACT AT AC TTG AAT GAAATCAATA CTATGCCGGG 950 

CTTTACGAGT CACTCCCGTT ATCCTGCCAT GATGGCAGCG GTCGGCTTAT 1000 

CCTATCAAGA ACTACTACAA AAACTGCTTG 1030 

50 



2) INFORMATION FOR SEQ ID NO: 1064 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064 

AAACATGAAA AAAATCGCCA TTATTTTTGG AGGCAATTCA CC GGAATAC A 50 

10 CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT 2 00 

TGTTGGATAC GAAACAGAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 250 

GGCTTTTGGT TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 3 00 

15 TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 350 

TTGAATTGAT GAAGCTACCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 400 

TTATGTATGA ACAAATGGTT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 450 

ACAAAGTGCT CCTACGATTC TCTTGACAAA TCACGCCAAC CAGCAAGAAC 500 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TTCCAGTTTT CTTTAAGCCT 5 50 

2 0 AATGAAGCGG GTTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

AATCGCTTCT GCCTTAAAAG AAGCCTTTAC TTATTGTTCC GCAGTGCTCC 65 0 

TACAAAAAAA TATTGCCGGT GTTGAGATCG GTTGCGGTAT TTTGGGCAAC 7 00 

GACTCTTTGA CTGTCGGTGC TTGTGACGCC ATTTCATTAG TAGACGGCTT 7 50 

TTTC GATTTT GAAGAAAAGT ACCAGCTGAT CAGCGCCAAA ATCACCGTCC .800 

25 CTGCGCCATT GCCTGAAACG ATTGAAACTA AGGTCAAAGA ACAAGCTCAG 85 0 

CTGCTCTATC GTAGTCTTGG AC TTAAAGGT CTTGCTCGCA TCGACTTTTT 900 

TGTCACGGAT CAAGGAGAAC TATACTTGAA TGAAATCAAT ACTATGCCGG 9 50 

GCTTTACGAG TCACTCCCGC TATCCTGCCA TGATGGCAGC GGTCGGGTTA 1000 

TCCTATCAAG AACTACTACA AAAACT AC TT GT 1032 

30 



40 



2 ) INFORMATION FOR SEQ ID NO: 1065 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
45 (B) STRAIN: R758 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1065 

AAAAACATGA AAAAAATCGC CATTATTTTT GGAGGCAATT C AC CGG AAT A 5 0 

5 0 CACCGTTTCT TTAGCTTCAG CAACTAGCGC AATCGAAGCA CTCCAATCAT 100 

CTCCCTATGA CTACGACCTC TCTTTGATCG GGATCGCCCC AGATGCTATG 150 

GATTGGTACT TGTATACAGG AGAACTGGAA AACATCCGAC AAG AC AC GTG 200 

GTTGTTGGAT ACGAAACATA AACAGAAAAT ACAGCCGCTA TTCGAAGGAA 250 

ACGGCTTTTG GCTAAGTGAA GAGCAGCAAA CGTTGGTACC TGATGTTTTA 3 00 

55 TTTC CC ATT A TGCATGGCAA ATACGGGGAA GATGGCAGTA TCCAAGGATT 350 

GTTTGAATTG ATGAAGC TGC CTTATGTAGG CTGCGGGGTG GCAAGTTCTG 400 

CCTTATGTAT GAACAAATGG CTGCTGCATC AAGCTGCAGC AGCCATTGGC 450 

GTACAAAGTG CTCCTACGAT TC TCTTG AC A AATCAAGCCA ACCAGCAAGA 500 

ACAAATCGAA GCTTTTATCC AGACCCATGG CTTTCCAGTT TTCTTTAAGC 550 

60 CTAATGAAGC GGGCTCCTCA AAAGGGATCA C TAAAGTC AC CTGCGTTGAA 600 
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CA 02307010 2000-05-19 



GAAATCGCTT CTGCCTTAAA AGAAGCCTTT 
CCTACAAAAA AATATTGCCG GTGTTGAGAT 
ACGACTCTTT GACTGTCGGT GCTTGTGACG 
TTTTTCGATT TTGAAGAAAA GTACCAGCTG 
5 CCCTGCGCCA TTGCCTGAAA CGATTGAAAC 
AGCTGCTCTA TCGTAGTCTT GGTCTTAAAG 
TTTGTCACGG ATCAAGGAGA ACTATACTTG 
GGGCTTTACG AGTCACTCCC GCTATCCTGC 
TATCCTATCA AGAACTACTA CAAAAACTGC 



AC TT ATTGTT CCGCAGTGCT 650 

CGGTTGCGGT ATTTTGGGCA 7 00 

CCATTTCATT AGTAGACGGC 7 50 

ATCAGCGCCA AAATCACCGT 800 

CAAGGTCAAA GAACAAGCTC 850 

GTCTTGCTCG CATCGACTTT 900 

AATGAAATCA ATACTATGCC 950 

CATGATGGCA GCGGTCGGCT 1000 

TTGT 1034 



2 ) INFORMATION FOR SEQ ID NO: 1066 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1012 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
25 (B) STRAIN: R760 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1066 



CATGAAAAAA ATCGCCATTA TTTTTGGAGG CAATTCACCG GAATAC AC C G 50 

30 TTTCTTTAGC CTCAGCAACT AGCGCAATCG AAGCACTCCA ATCATCTCCC 100 

TATGATTACG ACCTCTCTTT GATCGGGATC GCCCCAGATG CTATGGATTG ' 150 

GTACTTGTAT ACAGGAGAAC TGGAAAACAT CCGACAAGAC ACGTGGTTGT 200 

TGGATACGAA ACATACACAG AAAATCCAGC CACTTTTTGA AGGAAACGGC 250 

TTTTGGATAA GTGAAGCGCA G C AAACGTTG GTACCTGATG TTTTATTTCC 3 00 

35 CATTATGCAT GGTAAATACG GGGAAGATGG CAGTATCCAA GGATTGTTTG 3 50 

AATTGATGAA GCTGCCTTAT GTAGGCTGTG GGGTGGCAGC CTCTGCCTTA 400 

TGTATGAACA AATGGTTATT GCATCAAGCA GCAGCAGCGA TTGGCGTACA 450 

AAGCGCTCCT ACGATTCTCT TGACAAATCA AGCCAACCAG CAAAGACAAA 500 

TCGAAGCCTT TATCC AG AC C CATGGCTTTC CAGTTTTCTT TAAGC CTAAT 550 

40 GAAGCGGGCT CCTCAAAAGG GATCACAAAA GTAACTTGTG TTGAAGAAAT 600 

CGCTCCTGCC TTGAAGGAAG CCTTCGCTTA TTGTTCCGCA GTGCTCTTAC 650 

AAAAAAATAT CGCTGGCGTT GAGATTGGTT GCGGTATCTT AGGCAACGAC 700 

TCTTTGACTG TCGGTGCTTG TGACGCTATT TCATTAGTAG ACGGC TTTTT 750 

CGATTTTGAA GAAAAGT AC C AGTTGATCAG CGCCAAAATC ACCGTTCCTG 800 

45 CGCCATTGCC TGAAAC G ATT GAAACCAAAG TCAAAGAACA AGCTCAGCTG 850 

C TCTATC AC A GTCTTGGTCT TAAAGGACTT GCTCGCATCG AC TTTTTTGT 900 

CACGGATCAA GGAGAACTAT ACTTGAATGA AATCAATACT ATGCCGGGCT 95 0 

TTACGAGTCA CTCCCGCTAT CCTGCCATGA TGGCAGCGGT CGGCTTATCC 1000 

TATCAAGAAT TA 1012 

50 



2 ) INFORMATION FOR SEQ ID NO: 1067 



55 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 721 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1067 



CTTACGCTTT ATCGATTAGA CACGGGNAGC TTGTCCAATG GGRAGCCGAT 5 0 

10 TTGATTTTAT GGATGAACGC TCTCATCATG CGGCAAATGG AATATCATGC 100 

AATGAAGCGC AAAATCGCAG ACGTTTGCGC TCCATCATGG AAAACAGTGG 150 

GTTTGAAGCA TATAGCCTCG AATGGTGGCA CTATGTATTA AGAGACGAAC 2 00 

CATACCCCAA TAGCTATTTT GATTTCCCCG TTAAATAAAC TTTTAAC CGT 250 

TGCACGGACA AACTATATAA GCTAACTCTT TCGGCAGGAA ACCCGACGTA 3 00 

15 TGTAACTGGT TCTTAGGGAA TTTATATATA GTAGATAGTA TTGAAGATGT 3 50 

AAGGCAGAGC GATATTGCGG TCATTATCTG CGTGCGCTGC GGCAAGATAG 400 

CCTGATAATA AG AC TGATCG CATAGAGGGG TGGTATTTCA CACCGCCCAT 450 

TGTCAACAGG CAGTTCAGCC TCGTTAAATT CAGCATGGGT ATCACTTATG 500 

AAAATTCATC TACATTGGTG ATAATAGTAA ATCCAGTAGG GCGAAATAAT 55 0 

2 0 TGACTGTAAT TTACGGGGCA AAACGGCACA ATCTCAAACG AGATTGTGCC 600 

GTTTAAGGGG AAG ATTC TAG AAATATTTCA TACTTCCAAC TATATAGTTA 650 

AGGAGGAGAC TGAAAATGAA GAAGTTGTTT TTTTTATTGT TATTGTTATT 7 00 

CTTAATATAC TTAGGTTATT G 721 

25 



2 ) INFORMATION FOR SEQ ID NO: 1068 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 



40 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068 



ATTTTTAAGG 
ATGAAGCGCA 

45 TTTGAAGCAT 
ATACCCCAAT 
GCACGGACAA 
GTAACTGGTT 
AGGCAGAGCG 

50 CTGATAATAA 
GTCAACAGGC 
AAATTCATCT 
GACTGTAATT 
TTTAAGGGGA 

55 GGAGGAGACT 
TTAATATACT 



ATGAACGCTC 
AAATCGCAGA 
ATAGC CTCG A 
AGCTATTTTG 
ACTATATAAG 
CTTAGGGAAT 
ATATTGCGGT 
GACTGATCGC 
AGTTCAGCCT 
ACATTGGTGA 
TACGGGGCAA 
AGATTCTAGA 
GAAAATGAAG 
TAGGTTAT 



TTCATCATGC 
CGTTTGCGCT 
ATGGTGGCAC 
ATTTCCCCGT 
CTAACTCTTT 
TTATATATAG 
CATTATCTGC 
ATAGAGGGGT 
CGTTAAATTC 
TAATAGTAAA 
AACGGCACAA 
AATATTTCAT 
AAGTTGTTTT 



GGCAAATGGA 
CCATCATGGA 
TATGTATTAA 
TAAATAAACT 
C GGC AGG AAA 
TAGATAGTAT 
GTGCGCTGCG 
GGTATTTCAC 
AGCATGGGTA 
TCCAGTAGGG 
TCTCAAACGA 
ACTTCCAACT 
TTTTATTGTT 



ATATCATGCA 5 0 

AAACAGTGGG 100 

GAGACGAACC 150 

TTTAACCGTT 2 00 

CCCGACGTAT 2 50 

TGAAGATGTA 3 00 

GCAAGATAGC 3 50 

ACCGCCCATT 400 

TCACTTATGA 450 

CGAAATAATT 500 

GATTGTGCCG 5 50 

ATATAGTTAA 600 

ATTGTTATTC 650 
668 



60 2 ) INFORMATION FOR SEQ ID NO: 1069 

809 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R581 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1069 



CGGCAAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTRAGCTT 50 

GTACCAATGG GAAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 100 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 150 

CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 2 00 

2 0 TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 2 50 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 3 00 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 350 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 4 00 

GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 450 

25 GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 5 00 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 550 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 600 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 650 

ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 7 00 

3 0 TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 750 

GAAANCCTGA - 7 60 



35 2 ) INFORMATION FOR SEQ ID NO: 1070 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R610 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1070 



AAAAGGAATA CGGGGC TTTC AAAAATCCAA GCCATAACCC GCGGGCAAGT 50 

GCCATTTGAT TCTTACGCTT TAATCGATTA GACACGGGTA AGCTTGTACC 100 

AATGGGGAAC CGATTTGATT TTAATGNATG AACGCTCTTC ATCATGCGGC 150 

AAATGGAATA TCATGCAATG AAGCGCAAAA TCGCAGACGT TTGCGCTCCA 2 00 

55 TCATGGAAAA CAGTGGGTTT GAAGCATATA GCCTCGAATG GTGG C AC TAT 25 0 

GTATTAAGAG ACGAACCATA CCCCAATAGC TATTTTGATT TCC CCGTTAA 300 

ATAAACTTTT AACCGTTGCA CGGACAAACT ATATAAGCTA ACTCTTTCGG 3 50 

CAGGAAACCC GACGTATGTA ACTGGTTCTT AGGGAATTTA TATATAGTAG 400 

ATAGTATTGA AGATGTAAGG CAGAGCGATA TTGCGGTCAT TATCTGCGTG 450 

60 CGCTGCGGCA AGATAGCCTG ATAATAAGAC TGATCGCATA GAGGGGTGGT 500 

810 
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10 



20 



ATTTCACACC GCCCATTGTC AACAGGCAGT TCAGCCTCGT TAAATTCAGC 550 

ATGGGTATCA CTTATGAAAA TTCATCTACA TTGGTGATAA TAGTAAATCC 600 

AGTAGGGCGA AATAATTGAC TGTAATTTAC GGGGCAAAAC GGCACAATCT 650 

C AAAC GAG AT TGTGCCGTTT AAGGGGAAGA TTCTAGAAAT ATTTCATACT 7 00 

TCCAACTATA TAGTTAAGGA GG AG AC TGAA AATGAAGAAG TTGTTTTTTT 7 50 

TTATTGTTAT TGTTATTCTT AATATACTTA GGTTATGACT ACGTTAATGA 800 

A 801 



2 ) INFORMATION FOR SEQ ID NO: 1071 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 711 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R684 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1071 

TTGTACCAAT GGGGAGCCGA TTTGATTTTA TGGATGAACG CTCTCATCAT 50 

GCGGCAAATG GAATATCATG CAATGAAGCG CAAAATCGCA GACGTTTGCG 100 

CTCCATCATG GAAAACAGTG GGTTTGAAGC ATATAGCCTC GAATGGTGGC 150 

3 0 AC T ATGTATT AAGAGACGAA CCATACCCCA ATAGC TATTT TGATTTCCCC 2 00 

GTTAAATAAA CTTTTAACCG TTGCACGGAC AAACTATATA AGCTAACTCT 2 50 

TTCGGCAGGA AACCCGACGT ATGTAAC TGG TTCTTAGGGA ATTTATATAT 3 00 

AGTAGATAGT ATTGAAGATG TAAGGCAGAG CGATATTGCG GTCATTATCT 350 

GCGTGCGCTG CGGCAAGATA GCCTGATAAT AAGACTGATC GCATAGAGGG 400 

3 5 GTGGTATTTC ACACCGCCCA TTGTCAACAG GCAGTTCAGC CTCGTTAAAT 450 

TCAGCATGGG TATCACTTAT GAAAATTCAT CTACATTGGT GATAATAGTA 500 

AATCCAGTAG GGCGAAATAA TTGACTGTAA TTTACGGGGC AAAACGGCAC 550 

AATCTCAAAC GAGATTGTGC CGTTTAAGGG GAAGATTCTA GAAATATTTC 600 

ATACTTCCAA CTATATAGTT AAGGAGGAGA CTGAAAATGA AGAAGTTGTT 650 

4 0 TTTTTTATTG TTATTGTTAT TCTTAATATA CTTAGGTTAT GACTACGTTA 700 

ATGAAGCACT G 711 



45 2 ) INFORMATION FOR SEQ ID NO: 1072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1072 
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GCCATTGATC TTACGCTTTA TCGATTAGAC ACGGGTNAGC TTGTACCAAT 50 

GGGGAGCCGA TTTGATTTTA TGGATGAACG CTCTCATCAT GCGGCAAATG 100 

GAATATCATG CAATGAAGCG CAAAATCGCA GACGTTTGCG CTCCATCATG 150 

GAAAACAGTG GGTTTGAAGC ATATAGCCTC GAATGGTGGC AC TATGTATT 200 

5 AAGAGACGAA CCATACCCCA ATAGCTATTT TGATTTCCCC GTTAAATAAA 2 50 

CTTTTAACCG TTGCACGGAC AAACTATATA AGCTAACTCT TTCGGCAGGA 3 00 

AACCCGACGT ATGTAACTGG TTC TTAGGGA ATTTATATAT AGTAGATAGT 350 

ATTGAAGATG TAAGGCAGAG CGATATTGCG GTCATTATCT GCGTGCGCTG 400 

CGGCAAGATA GCCTGATAAT AAGACTGATC GCATAGAGGG GTGGTATTTC 450 

10 ACACCGCCCA TTGTCAACAG GCAGTTCAGC CTCGTTAAAT TCAGCATGGG 500 

TATCACTTAT GAAAATTCAT CTACATTGGT GATAATAGTA AATCCAGTAG 550 

GGCGAAATAA TTGACTGTAA TTTACGGGGC AAAACGGCAC AATCTCAAAC 600 

GAGATTGTGC CGTTTAAGGG GAAGATTCTA GAAATATTTC ATACTTCCAA 650 

CTATATAGTT AAGGAGGAGA CTGAAAATGA AGAAGTTGTT TTTTTTATTG 7 00 

15 TTATTGTTAT TCTTAATATA CTTAGGTTAT GACTACGTTA ATGAAGCACT 75 0 

G 751 



20 2 ) INFORMATION FOR SEQ ID NO: 1073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 

2 5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterdcoccus flavescens 

(B) STRAIN: R689 



35 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1073 



ATTTGATTTT ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT 50 

GCAATGAAGC GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT 100 

GGGTTTGAAG CATATAGCCT CGAATGGTGG CACTATGTAT T AAG AG AC G A 150 

ACCATACCCC AATAGCTATT TTGATTTCCC CGTTAAATAA AC TTTTAAC C 2 00 

40 GTTGCACGGA CAAACTATAT AAGC TAACTC TTTCGGCAGG AAACCCGACG 250 

TATGTAACTG GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT 3 00 

GTAAGGCAGA GCGATATTGC GGTCATTATC TGCGTGC GCT GCGGCAAGAT 35 0 

AGCCTGATAA TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC 400 

ATTGTCAACA GGCAGTTCAG CCTCGTTAAA TTCAGCATGG GTATCACTTA 45 0 

45 TGAAAATTCA TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA 500 

ATTGACTGTA ATTTACGGGG CAAAACGGCA CAATCTCAAA CGAGATTGTG 550 

CCGTTTAAGG GGAAGATTCT AGAAATATTT CATACTTCCA AC TAT AT AGT 600 

TAAGGAGGAG AC TG AAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA 650 

TTCTTAATAT ACTTAGGTTA TGACTACGTT AATGA 685 

50 

2) INFORMATION FOR SEQ ID NO: 1074 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDftESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1074 

ATCGATTAGA CACGGGTGAG CTTGTACCAA TGGGGAGCCG ATTTGATTTT 50 

10 ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT GCAATGAAGC 100 

GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT GGGTTTGAAG 150 

CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA ACCATACCCC 200 

AATAGC TATT TTGATTTCCC CGTTAAATAA ACTTTTAACC GTTGCACGGA 250 

CAAACTATAT AAGC TAACTC TTTCGGCAGG AAACCCGACG TATGTAACTG 300 

15 GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT GTAAGGCAGA 350 

GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT AGCC TGATAA 400 

TAAG AC TGAT CGCATAGAGG GGTGGTATTT CACACCGCCC ATTGTCAACA 45 0 

GGCAGTTCAG CCTCGTTAAA TTCAGCATGG GTATCACTTA TGAAAATTCA 500 

TCTACATTGG TGATAATAGT AAATC CAGTA GGGCGAAATA ATTGACTGTA 550 

20 ATTTACGGGG CAAAACGGCA CAATC TC AAA CGAGATTGTG CCGTTTAAGG 600 

GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT TAAGGAGGAG 650 

ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA TTCTTAATAT 700 

ACTTAGGTTA TG AC TACGTT AATGAAGCAC TG 732 



25 



40 



2 ) INFORMATION FOR SEQ ID NO: 1075 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 67 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1075 



TCTCATCATG CGGCAAATGG AATATCATGC AATGAAGCGC AAAATCGCAG 50 

ACGTTTGCGC TCCATCATGG AAAACAGTGG GTTTGAAGCA TATAGCCTCG 100 

45 AATGGTGGCA CTATGTATTA AGAGACGAAC CATACCCCAA TAGCTATTTT 150 

GATTTCCCCG TTAAATAAAC TTTTAACCGT TGCACGGACA AACTATATAA 200 

GCTAACTCTT TCGGCAGGAA ACCCGACGTA TGTAACTGGT TCTTAGGGAA 25 0 

TTTATATATA GTAGATAGTA TTGAAGATGT AAGGCAGAGC GATATTGCGG 300 

TCATTATCTG CGTGCGCTGC GGCAAGATAG CCTGATAATA AGACTGATCG 350 

50 CATAGAGGGG TGGTATTTCA CACCGCCCAT TGTCAACAGG CAGTTCAGCC 400 

TCGTTAAATT CAGC ATGGGT ATC ACTTATG AAAATTCATC TACATTGGTG 45 0 

ATAATAGTAA ATCCAGTAGG GCGAAATAAT TG AC TGTAAT TTACGGGGCA 500 

AAACGGCACA ATCTCAAACG AGATTGTGCC GTTTAAGGGG AAGATTC TAG 550 

AAATATTTCA TACTTCCAAC TATATAGTTA AGGAGGAGAC TGAAAATGAA 600 

55 GAAGTTGTTT TTTTTATTGT TATTGTTATT CTTAATATAC TTAGGTTATG 65 0 

ACTACGTTAA TGAAGCACTG 670 



60 2 ) INFORMATION FOR SEQ ID NO: 1076 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1246 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: DG131/3 serotype OX3 :H8 

(C) ACCESSION NUMBER: extracted from Z36901 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 6 

TTGTGTAATA TGAAAATAAT AATTTTTAGA GTGC TAACTT TTTTCTTTGT 50 

TATC TTTTCT GTTAATGTGG TTGCGAAGGA ATTTACCTTA GATTTCTCGA 100 

CAGCAAAGAC GTATGTAGAT TCGCTGAATG TCATTCGCTC TGCAATAGGT 150 

20 ACTCCATTAC AGACTATTTC ATCAGGAGGT ACGTCTTTAC TGATGATTGA 200 

TAGTGGCACA GGGGATAATT TGTTTGCAGT TGATGTCAGA GGGATAGATC 250 

CAGAGGAAGG GC GGTTTAAT AATCTACGGC TTATTGTTGA ACGAAATAAT 3 00 

TTATATGTGA CAGGATTTGT TAACAGGACA AATAATGTTT TTTATCGCTT 350 

TGCTGATTTT TCACATGTTA CCTTTCCTGG TACAACTGCG GTTACATTGT 400 

25 CTGGTGACAG TAGCTATACC ACGTTACAGC GTGTTGCGGG GATCAGTCGT 450 

ACGGGGATGC AGATAAATCG CCATTCGTTG ACTACTTCTT ATCTGGATTT 500 

AATGTCGCAT AGCGGAACCT CACTGACGCA GTCTGTGGCA AGAGCGATGT 550 

TACGGTTTGT TACTGTGACA GCTGAAGCTT T AC GTTTTCG GCAAATTCAG 600 

AGGGGATTTC GTACAACACT TGATGATCTC AGTGGGCGTT CTTATGTAAT 650 

30 GACTGCTGAA GATGTTGATC TTACGTTGAA CTGGGGAAGG TTGAGTAGTG 7 00 

TCCTGCCTGA CTATCATGGA CAAGACTCTG TTCGTGTTGG AAGAATTTCT 750 

TTTGGAAGTG TTAATGCAAT TCTGGGTAGC GTGGCATTAA TACTGAATTG 800 

TCATCATCAT GCATCGCGAG TTGCCAGAAT TGTACCTAAT GAGTTTCCTT 85 0 

CTATGTGCCC GGTAGATGGA AGAGTGCGTG GGATTACGCA CAATAAAATA 900 

35 TTGTGGGACT CATCCACTCT GGGGGCAATT TTGATACGCA GGGCTATTAG 950 

CAGTTGAGGG GGGTAAAATG AAAAAAATAT TATTAATAGC TGCATCACTT 1000 

TCATTTTTTT CAGCAAGTGT GCTGGCTGCG C C AG ATTGTG TAACTGGGAA 1050 

GGTGGAGTAT ACAAAATATA ATGATGACGA TACCTTTACA GTTAAAGTGG 1100 

GAGATAAAGA ATTATTTACT AACAGATGGA ATCTTCAGTC TCTTCTTCTC 1150 

40 AGTGCACAAA TTACGGGGAT GACGGTAACC ATTAAAACTA ATGCCTGTCA 12 00 

TAATGGAGGG GGATTCAGCG AGGTTATTTT CCGTTGACTC AGAATA 1246 



45 2 ) INFORMATION FOR SEQ ID NO: 1077 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1259 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: 94C serotype 048:H21 

(C) ACCESSION NUMBER: extracted from Z37725 



60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077 
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CACCTGTATA TGAAGTGTAT ATTATTTAAA TGGGTACTGT GCCTGTTACT 50 
GGGTTTTTCT TCGGTATCCT ATTCCCGGGA GTTTACGATA GAC TTTTCG A 100 
CCCAACAAAG TTATGTCTCT TCGTTAAATA GTATACGGAC AGAGATATCG 150 
5 ACCCCTCTTG AACATATATC TCAGGGGACC ACATCGGTGT CTGTTATTAA 2 00 

CCACACCCCA CCGGGCAGTT ATTTTGCTGT GGATATACGA GGGCTTGATG 250 
TCTATCAGGC GCGTTTTGAC CATCTTCGGC TGATTATTGA GCAAAATAAT 300 
TTATATGTGG CCGGGTTCGT TAATACGGCA AC AAATAC TT TCTACCGTTT 350 

TTCAGATTTT ACACATATAT CAGTGCCCGA TGTGACAACG GTTTCCATGA 400 

10 CAACGGACAG CAGTTATACC ACTC TG C AAC GTGTCGCAGC GCTGGAACGT 450 

TCCGGAATGC AAATCAGTCG TCACTCACTG GTTTCATCAT ATCTGGCGTT 500 

AATGGAGTTC AGTGGTAATA CAATGACCAG AGATGCATCC AGAGCAGTTC 550 

TGCGTTTTGT CACTGTCACA GCAGAAGCCT TACGCTTCAG GCAGATACAG 600 

AGAGAATTTC GTCAGGCACT GTCTGAAACT GCTCCTGTGT AT AC C ATG AC 650 

15 GCCGGGAGAC GTGGACCTCA CTCTGAACTG GGGGCGAATC AGCAATGTGC 7 00 

TTCCGGAGTA TCGGGGAGAG GATGGTGTCA GAGTGGGGAG AATATCCTTT 7 50 

AATAATATAT CAGCGATACT GGGTACTGTG GCCGTTATAC TGAATTGCCA 8 00 

TCATCAGGGG GCGCGTTCTG TTCGCGCCGT GAATGAAGAG AGTCAACCAG 8 50 

AATGTCAGAT AACTGGCGAC AGGCCCGTTA TAAAAATAAA CAATACATTA 900 

2 0 TGGGAAAGTA ATACAGCTGC AGCGTTTCTG AACAGAAAGT CACAGTTTTT 950 

ATATACAACG GGTAAATAAA GGAGTTAAGT ATGAAGAAGA TGTTTATGGC 1000 

GGTTTTATTT GCATTAGTTT CTGTTAATGC AATGGCGGCG GATTGTGCTA 1050 

AAGGTAAAAT TGAGTTTTCC AAGTATAATG AGGATGACAC ATTTACAGTG 1100 

AAGGTTGACG GGAAAGAATA CTGGACCAGT CGCTGGAATC TGCAACCGTT 1150 

25 ACTGCAAAGT GCTCAGCTGA CAGGAATGAC TGTCACAATC AAATCCAGTA 12 00 

CCTGTGAATC AGGCTCCGGA TTTGC TGAAG TGCAGTTTAA TAATGACTGA 12 50 

GGCATAACC 12 59 



30 



40 



45 



2 ) INFORMATION FOR SEQ ID NO: 1078 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1078 
AGTTCTGCGT TTTGTCACTG TC 22 

2 ) INFORMATION FOR SEQ ID NO : 1079 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 9 



60 



CGGAAGCACA TTGCTGATT ig 
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10 



20 



2 ) INFORMATION FOR SEQ ID NO : 1080 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1080 
15 TATAGCTACT GTCACCAGAC AATGT 25 



2 ) INFORMATION FOR SEQ ID NO: 1081 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081 

30 

ATGTCAGAGG GATAGATCCA 2 0 



35 2 ) INFORMATION FOR SEQ ID NO: 1082 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082 

TTGARCRAAA TAATTTATAT GTG 23 



50 

2 ) INFORMATION FOR SEQ ID NO: 1083 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1083 
TGATGATGRC AATTCAGTAT 2 0 

5 

2 ) INFORMATION FOR SEQ ID NO: 1084 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1084 
CCACGCCGCT TTGCTGATTT TTCACATGTT ACCGCGTGG 3 9 

20 

2 ) INFORMATION FOR SEQ ID NO: 1085 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1085 
35 CCACGCCACT GTCTGAAACT GCTCCTGTGC GTGG 34 



40 



2 ) INFORMATION FOR SEQ ID NO: 1086 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1086 

50 

CTACTCCCGC CTTTTGGGTT 20 



55 2 ) INFORMATION FOR SEQ ID NO: 1087 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1087 

CTCACAGCCC GAAACAGCCT 20 



10 



20 



25 



60 



2 ) INFORMATION FOR SEQ ID NO: 1088 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1088 
TGCCGTTTCC TGTATC CGTC 2 0 

2 ) INFORMATION FOR SEQ ID NO: 1089 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 20 bases 

(B) TYPE:--Nucleic-acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1089 
ATCCACACGG GCTAG AC C TC 20 

40 

2 ) INFORMATION FOR SEQ ID NO: 1090 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1090 
5 5 AATAGCGCGG ACGAATTGGA C 21 



2 ) INFORMATION FOR SEQ ID NO: 1091 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1091 
AACGCGGCAC TGTTTCCCAA 



2) INFORMATION FOR SEQ ID NO: 1092 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1092 
TCGGCAAGAC AATATGACAG C 



2 ) INFORMATION FOR SEQ ID NO: 1093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophytxcus 

(B) STRAIN: CSsa-165 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1093 

TAACGGGCGT CTCGATAGAA AAACACGTGA AAATCCCAAT GATTATAAAC 
AATCAATATA CGATTTTGCT GAAGCTGTAA CAAAAGGTAT TAAGGAACAA 
ACAAATAAAA ATTAATAGGC AACTTAACCA GAATCGTTAA AACTATATGA 
CGATTCTGGT TTTTTAAATT CAAAAAGTTT TCTAAAAAAT TTACCTGCTT 
TTTTAAAGTA TAGGTATAAA ATACAATTGA TTAAAATAGT AAAGGAAATG 
AATCATGAAA CAATTAACTA AGCCTTTATA CTTTTACCTA TTACTTTTTA 
TTACAACAAC ACTGATTGGC GCGTTAC TAT TATATTTGCC AATCACAGGT 
AAACATCCTA TTGATTTTGT GGACGCCCGT T 



2 ) INFORMATION FOR SEQ ID NO: 1094 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1094 
10 GGTAAAACAG GTACTTCTAA CTA 23 



15 



2 ) INFORMATION FOR SEQ ID NO : 1095 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1095 

25 

CGATAGAAGC AGCAGGACAA 20 



3 0 2 ) INFORMATION FOR SEQ ID NO: 1096 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1096 
CTGATGGATG CGGAAGATAC 20 

2 ) INFORMATION FOR SEQ ID NO: 1097 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1097 
TCYTCAAAAG GGATCACWAA AGTMAC 26 

60 
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10 



2 ) INFORMATION FOR SEQ ID NO: 1098 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNE S S : S ing 1 e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098 
15 TCTTCAAAAT CGAAAAAGCC GTC 2 3 

2 ) INFORMATION FOR SEQ ID NO: 1099 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1099 

3 0 TCAAAAGGGA TCACWAAAGT MAC 2 3 



35 



2 ) INFORMATION FOR SEQ ID NO: 1100 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100 

45 

GTAAAKCCCG GCATRGTRTTGATTTC 26 



50 2 ) INFORMATION FOR SEQ ID NO: 1101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101 
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GACGGYTTTT TYGATTTTGAAGA 2 3 



2 ) INFORMATION FOR SEQ ID NO: 1102 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



15 



20 



55 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1102 
AAAAARTCGA TKCGAGCMAGACC 23 

2 ) INFORMATION FOR SEQ ID NO: 1103 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1103 
ATCCCGCTAT GAAAACGATC 20 

35 

2 ) INFORMATION FOR SEQ ID NO: 1104 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104 
50 GGATCAACAC AGTAGAACCG 20 



2 ) INFORMATION FOR SEQ ID NO: 1105 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
60 (D) TOPOLOGY: Linear 
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5 



25 



35 



40 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1105 
CTCCTACGAT TCTCTTGAYA AATCA 25 



10 2 ) INFORMATION FOR SEQ ID NO: 1106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1106 

CAACCGATCT CAACACCGGC AAT 23 



2 ) INFORMATION FOR SEQ ID NO: 1107 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
30 (B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Single 

(D) * TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1107 
CTCATTTGAC TTCCTCCTTT GCT 23 

2 ) INFORMATION FOR SEQ ID NO : 1108 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108 
GTAAGAATCG GAAAAGCGGA AGG 23 

55 

2 ) INFORMATION FOR SEQ ID NO: 1109 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109 
AC ATC GTG AT CGCTAAAAGG AGC 



2 ) INFORMATION FOR SEQ ID NO: 1110 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : L inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110 
ACGAGAAAGA CAACAGGAAG ACC 



2 ) INFORMATION FOR SEQ ID NO: 1111 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : S ing 1 e 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1111 
CTTTTTCCGG CTCGWYTTCC TGATG 



2 ) INFORMATION FOR SEQ ID NO: 1112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1112 
GGCTGYGATA TTCAAAGCTC 
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2 ) INFORMATION FOR SEQ ID NO: 1113 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113 
ACCGACCTCA CAGCCCGAAA 2 0 

15 

2 ) INFORMATION FOR SEQ ID NO: 1114 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114 

3 0 TCWGAGCCTT TTTCCGGCTC G 21 



35 



2 ) INFORMATION FOR SEQ ID NO: 1115 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115 

45 

TTTCGGGCTG TGAGGTCGGB TGHGCG 2 6 



50 2 ) INFORMATION FOR SEQ ID NO : 1116 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1116 
TTTCGGGCTG TGAGGTCGGB TGHGCGG 

5 

2 ) INFORMATION FOR SEQ ID NO: 1117 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: 94 

2 0 (C) ACCESSION NUMBER: U9452 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117 

AAATTCGATC CGCACTACAT CGGAATTACA AAAAAC GGTG TATGGAAGCT 50 

2 5 ATGCAAGAAG CCATGTACGG AATGGGAAGC CGACAGTCTC CCCGCCATAC 100 

TCTCCCCGGA TAGGAAAACG CATGGGCTGC TTGTCATGAA AGAAAGCGAA 150 

TACGAAACAC GGCGTATTGA TGTGGCTTTC C CGGTTTTGC ATGGCAAATG 200 

CGGGGAGGAT GGTGCGATAC AGGGGCTGTT TGTATTGTCT GGTATCCCCT 2 50 

ATGTGGGCTG TGATATTCAA AGCTCCGCAG CTTGCATGGA CAAATCACTG 300 

3 0 GCCTACATTC TTACAAAAAA TGCGGGCATC GCCGTTCCCG AATTTCAAAT 350 

GATTGATAAA GGTGACAAGC CGGAGGCGGG TGCGCTTACC TACCCTGTCT 4 00 

TTGTG AAGC C GGCACGGTCA GGTTCGTCCT TTGGCGTAAC CAAAGTAAAC 450 

GGT AC GGAAG AACTTAACGC TGCGATAGAA GCGGCAGGAC AATATGATGG 50 0 

AAAAATCTTA ATTGAGCAAG CGATTTCGGG CTGTGAGGTC GGGTGTGCGG 5 50 

3 5 TCATGGGGAA CGAGGATGAT TTGATTGTCG GCGAAGTGGA TCAAATCCGG 600 

CTGAGCCACG GTATCTTCCG CATCCATCAG GAAAACGAGC CGGAAAAAGG 65 0 

CTCAGAAAAT GCGATGATTA CAGTTCCCGC AGACATTCCG GTCGAGGAAC 7 00 

GAAATCGGGT GCAGGAAACG GCAAAGAAAG TATATCGGGT GCTTGGATGC 750 

AGAGGGCTTG CCCGTGTTGA TCTTTTTTTG CAGGAGGATG GCGGCATCGT 800 

40 T 801 



27 



45 



2 ) INFORMATION FOR SEQ ID NO: 1118 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118 
TTTTCWGAGC CTTTTTCCGG CTCG z * 



60 2 ) INFORMATION FOR SEQ ID NO: 1119 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119 

TTTCGGGCTG TGAGGTCGGB TGHGC 25 



15 



25 



30 



2 ) INFORMATION FOR SEQ ID NO: 112 0 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
20 <B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 0 
TTTCGGGCTG TGAGGTCGGB TGHG 24 



2 ) INFORMATION FOR SEQ ID NO: 1121 

(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 22 bases 

( B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1121 
TGTTTGWATT GTCYGGYATC CC 22 

45 



2 ) INFORMATION FOR SEQ ID NO: 1122 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1122 
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18 

TGGTGCATTG CTACGTGG 



5 2 ) INFORMATION FOR SEQ ID NO: 1123 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1123 

TTTCGGGCTG TGAGGTCGGB TG 22 

20 

2 ) INFORMATION FOR SEQ ID NO: 1124 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
25 <B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : S ing le 

( D ) TOPOLOGY : Linear 



30 



35 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1124 
GATTTGRTCC ACYTCGCCRA CA 22 



2 ) INFORMATION FOR SEQ ID NO: 1125 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 112 5 
ACTCACAACT GGGATGGATG 20 

50 

2 ) INFORMATION FOR SEQ ID NO: 1126 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 6 

5 

TTATGGTTGT GCTGGTTGAG G 21 

10 2 ) INFORMATION FOR SEQ ID NO: 1127 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2 0 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1127 

KCAAAYGCCA TTTCAAGTAA 20 



25 



35 



40 



2 ) INFORMATION FOR SEQ ID NO: 1128 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
30 , (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1128 
GACGACYTTA TKGATATACA 20 

2 ) INFORMATION FOR SEQ ID NO: 1129 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1129 
ATGATGACCG AMATGATGAA AAC 23 

55 

2 ) INFORMATION FOR SEQ ID NO: 113 0 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1130 

.0 

GACTATCCAA GCATGCATTA TG 



15 2 ) INFORMATION FOR SEQ ID NO: 1131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131 

CATCTGGAGC TACRTARCCA GT 



2 ) INFORMATION FOR SEQ ID NO: 1132 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 20 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : L inear 

(ii) MOLECULE TYPE: DNA 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 2 
AGTGAAAARA TGGCTGCTGC 



2 ) INFORMATION FOR SEQ ID NO: 113 3 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1133 
CATCAAGAAC ACTGGCTAYG TAG 
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2 ) INFORMATION FOR SEQ ID NO : 113 4 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134 

15 

CTAGATAGAG CTAAAACCTT CCT 23 



20 2 ) INFORMATION FOR SEQ ID NO: 1135 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1135 

CATTATGCAA AC GCC ATTTC AAG 23 



35 



45 



50 



2 ) INFORMATION FOR SEQ ID NO: 113 6 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1136 
ACTTGTCCAC GTTSGATRTC T 21 

2) INFORMATION FOR SEQ ID NO: 113 7 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137 
AATTAATGGC TGCWGTTGAY GAA 



23 



10 



15 



20 



2 ) INFORMATION FOR SEQ ID NO: 1138 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1347 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
(C) ACCESSION NUMBER: M75132 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1138 



GAGCTCTTCC 

2 5 CACTAACAAT 

TTAGATTTTC 
TTTCTTTTCT 
AAAGGAAGAA 
CCAGAATACT 

3 0 TGACCCGCTG 

ATTGGTATTG 
CTAGAAGATC 
TATATTAGGA 
ATGGGAAGTA 

3 5 AACCTGCCTT 
CAAATGGCTC 
CCACTTTGCT 
TTTATTCAAG 
TTCTTCAAAA 

40 CATTAACGAC 
AT AGC GGGTA 
GATTGGTGCT 
AAGAGAAATA 
CCTCTCGCGC 

45 AAACTTGGGA 
AAGGAGCGAT 
CACTCCCGCT 
ATTAGTAGAG 
ATTACAATTG 

50 CGC AC TTAGT 



TTCAACGCAC 

AGCTTCCCCT 

ATCATAAAAA 

TTTCCTAGGT 

AATGATGAAA 

CAGTGTCACT 

AAATATGAAG 

GTATCAAGGA 

ACAAAAACTG 

GAAAAACGAA 

TGGCGAGGAT 

ATGTTGGTTG 

TTGCATCAAC 

TTTATCCCGC 

ACCATGGATT 

GGGATCACAA 

TGCTTTTGCT 

TTGAAATTGG 

TGTGATGCGA 

CCAATTAATC 

TTGAATCACA 

TTG AC GGGTC 

TTATTTAAAC 

ACCCAGCTAT 

CAATTGATTG 

ATCAATAAAA 

GCTAGCTCCT 



TTCTGTACCA 

GCTTTCTTCA 

TACGAGAAAG 

AC AC TG AATG 

AAAATTGCCG 

AACCTCAGCA 

TAATGAC C AT 

AACCTCGCGA 

TCACCAGCTG 

TCGTCCCTGA 

GGCTGTATCC 

CCATGTCGCT 

TTGCTGATAC 

TATGAAAACG 

CCCGATCTTT 

AAGTAACTGA 

TACGGTTCTA 

CTGCGGCATC 

TTTCTCTTGT 

AGCGCCACGA 

GATCAAGGAG 

TGGCTCGAAT 

GAAATCAACA 

GATGGCGGAA 

CACTGGCAGA 

ACCATCCATT 

TTTAGCGATC 



AGAGTTGTTG 

AGCCCTTTGT 

ACAACAGGAA 

T AAC C TT AAA 

TTTTATTTGG 

GCAAGTGTGA 

TGGCATCGCA 

ATGTTCGCAA 

AC TTTTTC TA 

TGTCCTCTTT 

AAGGACTGCT 

GCCTCCGCAT 

CATGGGAATC 

ATCCTGCCAC 

ATCAAGCCGA 

CAAAACAGCG 

C TGTGTTG AT 

TTAGGAAATG 

CGACGGTTTT 

TCACTGTCCC 

CAGGCACAGC 

CGATTTTTTC 

CCATGCCGGG 

GTCGGGTTAT 

GGAGGACAAA 

GAAAAAAAAT 

ACGATGTTTA 



TCCATTTGAT 50 

CATAAAATCG 100 

GACCGCAAAT 15 0 

AGAAAAAAGG 2 00 

AGGGAATTCT 2 50 

TCCAAGCTAT 3 00 

CCAACAATGG 3 50 

TGATACTTGG 400 

GCCAAGGATT 450 

CCAGTCTTGC 5 00 

TG AAC T AATG 550 

TATGTATGAA 600 

GCTAGTGCTC 650 

AATCGATCGT 7 00 

ATGAAGCCGG 7 50 
CTCCAATCTG 800 
CCAAAAGGCG 850 
AGCAATTGAC 900 
TTTGATTTTG 950 

AGCACCATTG 1000 

TGCTTTATCG 1050 

GTCACCAATC 1100 

ATTTAC TGGG 1150 

CCTACGAAAT 12 00 

CGATGAACAC 12 50 

CAAGAGCCCC 1300 

CCTGCAG 1347 



2 ) INFORMATION FOR SEQ ID NO: 113 9 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1768 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

(C) ACCESSION NUMBER: X56895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1139 



GATATCGTTA CGCTTCATGT GCCGCTCAAT ACGGATACGC ACTATATTAT 50 

CAGCCACGAA CAAATACAGA GAATGAAGCA AGGAGCATTT CTTATCAATA 100 

CTGGGCGCGG TCCACTTGTA GATACCTATG AGTTGGTTAA AGCATTAGAA 15 0 

AACGGGAAAC TGGGCGGTGC CGCATTGGAT GTATTGGAAG GAGAGGAAGA 2 00 

15 GTTTTTCTAC TCTGATTGCA CCCAAAAACC AATTGATAAT CAATTTTTAC 2 50 

TTAAACTTCA AAGAATGCCT AACGTGATAA TCACACCGCA TACGGCCTAT 3 00 

TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 3 50 

TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 400 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 450 

2 0 AG AG AT AGC C GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 500 

GAATTACGAA ATCTGGTGTA TGGAAAATGT GCGAAAAACC TTGCGCGGAA 550 

TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 600 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 650 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 700 

25 CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 750 

AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 800 

ATGCTGGGAT AGCTACTCCC GCCTTTTGGG TTATTAATAA AGATGATAGG 850 

CCGGTGGCAG C TACGTTT AC CTATCCTGTT TTTGTTAAGC CGGCGCGTTC 900 

AGGCTCATCC TTCGGTGTGA AAAAAGTCAA TAGCGCGGAC GAATTGGACT 950 

3 0 ACGCAATTGA ATCGGCAAGA CAATATGACA GCAAAATCTT AATTGAGCAG 1000 

GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG GTATTGGGAA ACAGTGCCGC 1050 

GTTAGTTGTT GGCGAGGTGG ACCAAATCAG GCTGCAGTAC GGAATCTTTC 1100 

GTATTCATCA GGAAGTCGAG CCGGAAAAAG GCTCTGAAAA CGCAGTTATA 1150 

ACCGTTCCCG CAGACCTTTC AGCAGAGGAG CGAGGACGGA TACAGGAAAC 12 00 

3 5 GGCAAAAAAA ATATATAAAG CGCTCGGCTG TAGAGGTCTA GCCCGTGTGG 1250 

ATATGTTTTT ACAAGATAAC GGCCGCATTG TACTGAACGA AGTCAATACT 13 00 

CTGCCCGGTT TCACGTCATA CAGTCGTTAT CCCCGTATGA TGGCCGCTGC 1350 

AGGTATTGCA CTTCCCGAAC TGATTGACCG CTTGATCGTA TTAGCGTTAA 1400 

AGGGGTGATA AGCATGGAAA TAGGATTTAC TTTTTTAGAT GAAATAGTAC 1450 

40 ACGGTGTTCG TTGGGACGCT AAATATGCCA CTTGGGATAA TTTCACCGGA 1500 

AAAC CGGTTG ACGGTTATGA AGTAAATCGC ATTGTAGGGA CATACGAGTT 1550 

GGCTGAATCG CTTTTGAAGG CAAAAGAACT GGCTGCTACC CAAGGGTACG 1600 

GATTGCTTCT ATGGGACGGT TACCGTCCTA AGCGTGCTGT AAAC TGTTTT 1650 

ATGCAATGGG CTGCACAGCC GGAAAATAAC CTGACAAAGG AAAGTTATTA 17 00 

4 5 TCCCAATATT GACCGAACTG AGATGATTTC AAAAGGATAC GTGGCTTCAA 1750 

AATCAAGCCA TAGCCGCG 17 68 



50 2 ) INFORMATION FOR SEQ ID NO: 1140 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1086 bases 

(B) TYPE : Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Enterococcus casseliflavus 
(C) ACCESSION NUMBER: L29638 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1140 

5 GTAAGAATCG GAAAAGCGGA AGGAAGAAAA ACATGAAAAA AATCGCCATT 50 

ATTTTTGGAG GCAATTCACC GGAATACACC GTTTC TTT AG CTTCAGCAAC 100 

TAGCGCAATC GAAGCACTCC AATCATCTCC CTATGACTAC GACCTCTCTT 150 

TGATCGGGAT CGCCCCAGAT GCTATGGATT GGTACTTGTA TACAGGAGAA 2 00 

10 CTGGAAAACA TCCGACAAGA CACGTGGTTG TTGGATACGA AACATAAACA 2 50 

GAAAATACAG CCGCTATTCG AAGGAAACGG CTTTTGGCTA AGTGAAGAGC 300 

AGCAAACGTT GGTACCTGAT GTTTTATTTC CCATTATGCA TGGCAAATAC 3 50 

GGGGAAGATG GCAGTATCCA AGGATTGTTT GAATTGATGA AGCTGCCTTA 400 

TGTAGGC TGC GGGGTGGCAG GTTCTGCCTT ATGTATGAAC AAATGGCTGC 450 

15 TGCATCAAGC TGCAGCAGCC ATTGGCGTAC AAAGTGCTCC TACGATTCTC 500 

TTGACAAATC AAGCCAACCA GCAAGAACAA ATCGAAGCTT TTATCCAGAC 550 

CCATGGCTTC CCAGTTTTCT TTAAGCCTAA TGAAGCGGGC TCCTCAAAAG 600 

GGATCACTAA AGTCACCTGC GTTGAAGAAA TCGCTTCTGC CTTAAAAGAA 650 

GCCTTTACTT ATTGTTCCGC AGTGCTCCTA CAAAAAAATA TTGCCGGTGT 700 

2 0 TGAGATCGGT TGCGGTATTT TGGGCAACGA CTCTTTGACT GTCGGTGCTT 750 

GTGACGCCAT TTCATTAGTA GACGGCTTTT TCGATTTTGA AGAAAAGTAC 800 

CAGCTGATCA GCGCCAAAAT CACCGTCCCT GCGCCATTGC CTGAAACGAT 850 

TGAAACCAAG GTCAAAGAAC AAGCTCAGCT GCTCTATCGT AGTCTTGGTC 900 

TTAAAGGTCT TGCTCGCATC GACTTTTTTG TCACGGAGCG AGGAGAACTA 950 

25 TACTTGAATG AAATCAATAC TATGCCGGGC TTTACGAGTC ACTCCCGCTA 1000 

TCCTGCCATG ATGGCAGCGG TCGGCTTATC CTATCAAGAA CTACTACAAA 1050 

AACTGCTTGT CTTAGCAAAG GAGGAAGTCA AATGAG 1086 



30 2) INFORMATION FOR SEQ ID NO: 1141 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3946 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE : Genomi c DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

(C) ACCESSION NUMBER: extracted from M97297 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1141 

ATGAATAACA TCGGCATTAC TGTTTATGGA TGTGAGCAGG ATGAGGCAGA 50 

TGCATTCCAT GCTCTTTCGC CTCGCTTTGG CGTTATGGCA ACGATAATTA 100 

ACGCCAACGT GTCGGAATCC AACGCCAAAT CCGCGCCTTT CAATCAATGT 150 

50 ATCAGTGTGG GACATAAATC AG AG ATTTC C GCCTCTATTC TTCTTGCGCT 200 

GAAGAGAGCC GGTGTGAAAT ATATTTCTAC CCGAAGCATC GGCTGCAATC 250 

ATATAGATAC AACTGCTGCT AAGAGAATGG GCATCACTGT CGACAATGTG 300 

GCGTACTCGC CGGATAGCGT TGC CG ATT AT ACTATGATGC TAATTCTTAT 350 

GGCAGTACGC AACGTAAAAT CGATTGTGCG CTCTGTGGAA AAACATGATT 400 

55 TCAGGTTGGA CAGCGACCGT GGCAAGGTAC TCAGCGACAT GACAGTTGGT 450 

GTGGTGGGAA CGGGC C AG AT AGGCAAAGCG GTTATTGAGC GGCTGCGAGG 500 

ATTTGGATGT AAAGTGTTGG CTTATAGTCG CAGCCGAAGT ATAGAGGTAA 550 

AC TATGTACC GTTTGATGAG TTGC TGC AAA ATAGCGATAT CGTTACGCTT 600 

CATGTGCCGC TCAATACGGA TACGCACTAT ATTATCAGCC ACGAACAAAT 650 

60 ACAGAGAATG AAGCAAGGAG CATTTCTTAT CAATACTGGG CGCGGTCCAC 700 
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TTGTAGATAC CTATGAGTTG GTTAAAGCAT TAGAAAACGG GAAACTGGGC 7 50 

GGTGCCGCAT TGGATGTATT GGAAGGAGAG GAAGAGTTTT TCTACTCTGA 800 
TTGCACCCAA AAACCAATTG ATAATCAATT TTTACTTAAA CTTCAAAGAA 850 
TGCCTAACGT GATAATCACA CCGCATACGG C C TATTAT AC CGAGCAAGCG 900 
5 TTGCGTGATA CCGTTGAAAA AACCATTAAA AACTGTTTGG ATTTTGAAAG 950 

GAGACAGGAG CATGAATAGA AT AAAAG TTG CAATACTGTT TGGGGGTTGC 1000 

TCAGAGGAGC ATG AC GTATC GGTAAAATCT GCAATAGAGA TAGCCGCTAA 1050 

CATTAATAAA GAAAAATACG AGCCGTTATA CATTGGAATT ACGAAATCTG 1100 

GTGTATGGAA AATGTGCGAA AAACCTTGCG CGGAATGGGA AAACGACAAT 1150 

10 TGCTATTCAG CTGTACTCTC GCCGGATAAA AAAATGCACG GATTACTTGT 1200 

TAAAAAGAAC CATGAATATG AAATCAACCA TGTTGATGTA GCATTTTCAG 1250 

CTTTGCATGG CAAGTCAGGT GAAGATGGAT C C AT AC AAGG TCTGTTTGAA 1300 

TTGTCCGGTA TCCCTTTTGT AGGCTGCGAT ATTCAAAGCT CAGCAATTTG 1350 

TATGGACAAA TCGTTGACAT ACATCGTTGC GAAAAATGCT GGGATAGCTA 1400 

15 CTCCCGCCTT TTGGGTTATT AATAAAGATG ATAGGCCGGT GGCAGCTACG 1450 

TTTACCTATC CTGTTTTTGT TAAGCCGGCG CGTTCAGGCT CATCCTTCGG 1500 

TGTGAAAAAA GTCAATAGCG CGGACGAATT GGACTACGCA ATTGAATCGG 15 50 

CAAGACAATA TGACAGCAAA ATCTTAATTG AGCAGGCTGT TTCGGGCTGT 1600 

GAGGTCGGTT GTGCGGTATT GGGAAACAGT GCCGCGTTAG TTGTTGGCGA 1650 

20 GGTGGACCAA ATCAGGCTGC AGTACGGAAT CTTTCGTATT CATCAGGAAG 17 00 

TCGAGCCGGA AAAAGGCTCT GAAAACGCAG TTATAACCGT TCCCGCAGAC 17 50 

CTTTCAGCAG AGGAGCGAGG ACGGATACAG GAAACGGCAA AAAAAATATA 1800 

TAAAGCGCTC GGCTGTAGAG GTCTAGCCCG TGTGGATATG TTTTTACAAG 1850 

ATAACGGCCG CATTGTACTG AACGAAGTCA ATACTCTGCC CGGTTTCACG 1900 

25 TCATACAGTC GTTATCCCCG TATGATGGCC GCTGCAGGTA TTGCACTTCC 1950 

CGAACTGATT GACCGCTTGA TCGTATTAGC GTTAAAGGGG TGATAAGCAT 2 000 

GGAAATAGGA TTTACTTTTT TAGATGAAAT AGTACACGGT GTTCGTTGGG 2 050 

ACGCTAAATA TGCCACTTGG GATAATTTCA CCGGAAAACC GGTTGACGGT 2100 

TATGAAGTAA ATCGCATTGT AGGGACATAC GAGTTGGCTG AATCGCTTTT 2150 

30 GAAGGCAAAA GAACTGGCTG CTACCCAAGG GTACGGATTG CTTCTATGGG 22 00 

ACGGTTACCG TCCTAAGCGT GCTGTAAACT GTTTTATGCA ATGGGCTGCA 225 0 

CAGCCGGAAA ATAACCTGAC AAAGGAAAGT TATTATCCCA ATATTGACCG 2300 

AACTGAGATG ATTTCAAAAG GATACGTGGC TTCAAAATCA AGCCATAGCC 2 350 

GCGGCAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTGAGCTT 2400 

3 5 GTACCAATGG GGAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 2450 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 2 500 

CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 2 550 

TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 2 600 

TAAATAAACT TTTAACCGTT GCACGGACAA AC T AT AT AAG CTAACTCTTT 2 650 

40 CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 2700 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 2750 

GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 2 800 

GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 2 850 

AGCATGGGTA TCAC TTATG A AAATTCATCT ACATTGGTGA TAATAGTAAA 2900 

45 TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 2950 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 3000 

ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 3050 

TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 3100 

GAAGCACTGT TTTCTCAGGA AAAAGTCGAA TTTCAAAATT ATGATCAAAA 3150 

50 TCCCAAAGAA CATTTAGAAA ATAGTGGGAC TTCTGAAAAT ACCCAAGAGA 3200 

AAACAATTAC AGAAGAACAG GTTTATCAAG GAAATCTGCT ATTAATCAAT 3250 

AGTAAATATC CTGTTCGCCA AGAAAGTGTG AAGTCAGATA TCGTGAATTT 3 3 00 

ATCTAAACAT GACGAATTAA TAAATGGATA CGGGTTGCTT GATAGTAATA 3350 

TTTATATGTC AAAAGAAATA GCACAAAAAT TTTCAGAGAT GGTCAATGAT 3400 

55 GCTGTAAAGG GTGGCGTTAG TCATTTTATT ATTAATAGTG GCTATCGAGA 3450 

CTTTGATGAG CAAAGTGTGC TTTACCAAGA AATGGGGGCT GAGTATGCCT 3 500 

TACCAGCAGG TTATAGTGAG CATAATTCAG GTTTATCACT AGATGTAGGA 3 550 

TCAAGCTTGA CGAAAATGGA ACGAGCCCCT GAAGGAAAGT GGATAGAAGA 3 600 

AAATGCTTGG AAATACGGGT TCATTTTACG TTATCCAGAG GACAAAACAG 3 650 

60 AGTTAACAGG AATTCAATAT GAACCATGGC ATATTCGCTA TGTTGGTTTA 37 00 
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CCACATAGTG CGATTATGAA AGAAAAGAAT 
GGATTACCTA AAAGAAGAAA AAACCATTTC 
AATATGAGAT CTTTTATTAT CCTGTTACTA 
CCGACTAATC TTCGTTATGA GATATCAGGA 
5 TGTGACAGTG TTTCCCGGAT CAACACATAC 



TTCGTTCTCG AGGAATATAT 3 7 50 

TGTTAGTGTA AATGGGGAAA 3800 

AAAAT AC C AC CATTCATGTG 3850 

AACAATATAG ACGGTGTAAT 3 900 

TAATTCAAGG AGGTAA 3 946 



10 



2 ) INFORMATION FOR SEQ ID NO: 1142 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1142 

20 5 . 
GATCCTCTAA ATGATTCTCA GGTGG " 



25 2) INFORMATION FOR SEQ ID NO: 1143 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

3 0 (C) _S TRANDEDNES S : Single 

. . (D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1143 

CAATTAGCTT AGCAATAGGT GTTGG 25 

40 

2 ) INFORMATION FOR SEQ ID NO: 1144 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



55 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1144 
TGTYTTCCAA GGTTCAGCTC 20 

2 ) INFORMATION FOR SEQ ID NO: 1145 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145 
10 AACATATTKG GTTGATAGGT 2 0 



15 



45 



55 



2 ) INFORMATION FOR SEQ ID NO: 1146 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1146 

25 

GGGATTACCT ATGCCAATATGAT 



3 0 2 LINFORMATION FOR SEQ ID NO : 1147 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147 

AGCTGTGTTA GCVCGAACAT CTTG 24 



2 ) INFORMATION FOR SEQ ID NO: 1148 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1148 
GACTTTGTTT GGCGTGATAT 20 
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10 



20 



30 



2 ) INFORMATION FOR SEQ ID NO: 1149 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1149 

20 

15 TCCYACWATT TCTTTTTGWG 



2 ) INFORMATION FOR SEQ ID NO: 1150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150 



. 20 

TGATAATCAC ACCGCATACG 



35 2 ) INFORMATION FOR SEQ ID NO: 1151 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1151 

TGCTGTCATA TTGTCTTGCC 



50 „„„ 
2 ) INFORMATION FOR SEQ ID NO: 1152 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1152 
ATAAAGATGA TAGGCCGGTG 2 0 

5 

2 ) INFORMATION FOR SEQ ID NO: 1153 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1153 
2 0 CTCGTATGTC CCTACAATGC 20 



25 



2 ) INFORMATION FOR SEQ ID NO: 1154 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1154 

35 

GTTTGAAGCA TATAGCCTCG 20 



40 2 ) INFORMATION FOR SEQ ID NO: 1155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1155 

CAGTGCTTCA TTAACGTAGT C 21 



55 

2) INFORMATION FOR SEQ ID NO: 1156 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156 
GTTGAAATGC ATCACGAACA ATT 

10 

2) INFORMATION FOR SEQ ID NO: 1157 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

MOLECULE TYPE: DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 1157 
25 AAGAACGTTT CAGTTAAGGA AAT 



2 ) INFORMATION FOR SEQ ID NO: 1158 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1158 

40 

AAGAGGTAAT GTCTGTGGT 



15 



(i) 



20 



(ii) 
(xi) 



45 2 ) INFORMATION FOR SEQ ID NO: 1159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1159 

TGAAGGTTTG CCAGGTGA 
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2 ) INFORMATION FOR SEQ ID NO : 1160 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1160 
CGTTTCTGTT AAAGAAATTA GAAG 24 

15 



2 ) INFORMATION FOR SEQ ID NO: 1161 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161 

3 0 TCCAGGTGAT AACGTTGG 18 



35 



2 ) INFORMATION FOR SEQ ID NO: 1162 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1162 

45 

CAAGTCCGTG GAAATGCA 18 



5 0 2 ) INFORMATION FOR SEQ ID NO: 1163 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1163 

21 

GTTGGTTTCA ACGTTAAGAA C 

5 

2 ) INFORMATION FOR SEQ ID NO: 1164 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164 
GGTTTCAACG TCAAGAAC 

20 

2 ) INFORMATION FOR SEQ ID NO: 1165 

25 ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1165 
GGTTTCAACG TGAAGAAC 18 



30 



35 



40 



2 ) INFORMATION FOR SEQ ID NO: 1166 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1166 
50 ~ 

ACGTTAAGAA TGTTTCTGTC AA ^ z 



55 2 ) INFORMATION FOR SEQ ID NO: 1167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 bases 

(B) TYPE: Nucleic acid 



842 



CA 02307010 2000-05-19 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1167 
GTTGGTTTCA ACGT 14 

10 

2 ) INFORMATION FOR SEQ ID NO: 1168 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1168 
GAACAATTGG TTGAAGGTGT 2 0 

25 

2 ) INFORMATION FOR SEQ ID NO : 1169 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: SP-665 

(C) ACCESSION NUMBER: AF139883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1169 

45 ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTC T 100 

AC T AC G TT AG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 15 0 

AC TTC TAGT A AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 2 00 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

50 TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 3 00 

ATTGATTCCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGTAA 350 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAGTTGACTT 400 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 450 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

55 GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 65 0 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 700 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 7 50 

60 GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 
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TTACCCTGCT TACATGGATA ATTACCTCAA GGAGGTCATC AATCAAGTAG 850 

AACAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 

amgtSacc aagaagctca AAAACATCTG TGGGATATCT ACAACTCCGA 950 

T^TACGTC TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG 1000 

5 JagItcStTC AaSgGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA 1050 

GCAAGTAACG TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA 1100 

SgISgTOCT GCTATGAAAC CAATCACCGA TTATGCACCT GCCATAGAAT 1150 

aSg?gStI tgattccact gcaactatgg ttaatgatat TCCTTATAAC 1200 

?a?c2ggga£ SKgcacacc tgtctacaac tgggatagag catatttcgg 1250 

10 TAATATTACT CTGCAATATG CTCTTCAACA ATCACGAAAT GTCACAGCCG 1300 

?tcI5acttt GAATAAGGTC GGTCTAGATA GAGCTAAAAC CTTCCTTAAT 1350 

GGTCTTGGTA ?CGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG 1400 

TAA.TACAACA GAATCTAATA AACAATACGG AGCAAGTAGT GAAAAAATGG 1450 

CTGCTGCTTA TGCTGCCTTT GCAAATGGTG GCACTTACTA TAAACCAATG 1500 

is SatcSSa AAGTCGTCTT CAGTGATGGA AGTAAAAAAG AGTTCTCTAA 1550 

TGTCGGAACT CGTGCCATGA AGGAAACGAC AGCCTATATG ATGACCGACA 1600 

TGATGAAAAC AGTCTTGACT TATGGAACTG GGCGTGGAGC CTATCTTCCT 1650 

^S??^? AAGCTGGTAA AACAGGAACC TCTAACTATA CAGATGAGGA 1700 

AGTTGAAAAC CACATCAAGA ACACTGGCTA TGTAGCTCCA GATGAAATGT 1750 

20 ^?G^A T ACTC GT AAG TATTCTATGG CTGTATGGAC AGGTTATTCG 1800 

AATCGTTTAA CTCCTATCGT TGGAGATGGT TTCCTAGTTG CAGCTAAAGT 1850 

TTATCGCTCA ATGATAACGT ATC TATC AG A AGATACTCAT CCAGAAGACT 1900 

GGACGATGCC AGACGGACTT TTCAGAAACG GGGAATTTGT ATTCAAAAAT 1950 

SgagStcgcc CAATATGGAC TGAACCCTCT ACTCAACAAT CCTCAACAGC 2000 

25 SSSSSS AGCTCATCAT CAGATAGTTC AACTTCACAG TCTAGCTCAA 2050 

25 CACAAATAAT AGTACGACTA CCAATCCTAA CAATAATACG 2100 

CAACAATCAA ATACAACCCC TGATCAACAA AATCAGAATC CTCAACCAGC 2150 
ACAACCATAA 



30 



2 ) INFORMATION FOR SEQ ID NO: 1170 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1170 
ACGAATTGGA CTACGCAATT 

45 



20 



2 ) INFORMATION FOR SEQ ID NO: 1171 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1171 

2 0 

60 ACGAGGATGA TTTGATTGTC 
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2 ) INFORMATION FOR SEQ ID NO: 1172 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 64147 

(C) ACCESSION NUMBER: X13136 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1172 



GATCCTCTAA ATGATTCTCA GGTGGCTGTT ATTGCCTCTA TTTCAAAGGA 50 

GATGCCTGGC ATTAGTATTT CTACTTCTTG GGATAGAAAG GTTTTGGAAA 100 

CTTCCCTTTC TTCTATAGTT GGGAGTGTAT CCAGTGAAAA AGCTGGTCTC 150 

CCAGCGGAAG AAGCAGAAGC CTATCTTAAA AAAGGC TATT CTCTAAATGA 200 

25 C CGTGTAGG A ACCTCCTATT TGGAAAAGCA ATATGAAGAG ACCTTACAAG 2 50 

GAAAACGCTC GGTAAAAGAA ATCCATCTGG ACAAATATGG CAACATGGAA 3 00 

AGCGTGGATA CAATTGAGGA AGGTAGTAAG GGAAACAATA TCAAGCTGAC 3 50 

CATTGATTTG GCCTTCCAAG ATAGCGTGGA TGCTTTGCTG AAAAGTTATT 4 00 

TCAATTCAGA GTTGGGAAAT GGTGGAGCCA AGTATTCTGA AGGTGTCTAT 4 50 

3 0. .GCAGTCGCCC TTAACCCAAA AACAGGTGCT GTTTTGTCTA TGTCAGGAAT 5 00 

TAAACATGAC TTGAAAACAG GAGAGTTGAC GCCGGATTCC TTGGGAACGG 5 50 

TAACCAATGT CTTTGTCCCA GGTTCGGTTG TCAAAGCAGC GACCATCAGC 600 

TCTGGTTGGG AAAATGGAGT CTTGTCAGGA AATCAGACCT TGACAGACCA 650 

GTCCATTGTC TTTCAAGGTT CAGCTCCAAT TAATTC TTGG TATCCTGCCT 7 00 

3 5 TTTC TAG AC C AATGC CG ATT ACGGCGGTTC AGGCTCTAGA GTATTCATCC 750 

AATGCTTATA TGGTC CAAAC AGCCCTAGGT CTTATGGGGC AG AC C TATC A 800 

ACCCAATATG TTTGTCGGCA CCAGCAATCT AGAGTCTGCT ATGGGGAAAT 850 

TGCGTTCAAC CTTTGGTGAA TATGGTTTGG GTTCTGCGAC CGGAATTGAC 900 

CTACCAGATG AATCTACTGG ATTTGTTCCC AAAGAGTATA GCTTTGCTAA 950 

4 0 TTTCATTACC AATGC CTTTG GGCAGTTTGA TAACTATACG CCGATGCAGT 1000 

TGGCTCAGTA TGTAGCAACT ATTGCAAATG ATGGTGTTCG TGTGGCTCCT 1050 

CGTATTGTTG AAGGCATTTA TGGTAATAAT GATAAGGGAG GACTGGGTGA 1100 

CTTGATTCAG CAACTGCAAC CGACAGAGAT GAATAAGGTC AATATATCCG 1150 

ACTCCGATAT GAGCATCTTG CACCAAGGTT TTTATCAGGT TGCCCATGGT 1200 

45 AC TAGTG AAT TGACAACTGG ACGTGCCTTT TCAAATGGCG CCTTGGTATC 12 50 

CATTAGCGGA AAAACAGGTA CAGCCGAAAG CTATGTGGCA GATGGTCAGC 13 00 

AAGCAACCAA TACCAATGCG GTGGCCTATG CCCCATCTGA TAATCCCCAA 1350 

ATCGC TGTAG CTGTTGTCTT CCCTCATAAC ACCAACCTTA CAAATGGTGT 1400 

CGGACCTTCC ATTGCGCGCG ATATTATCAA CCTCTATAAC CAACATCATC 1450 

50 CAATGAATTA GAAAGGAACA TATGCTTTAT CCAACACCTA TTGCTAAGCT 1500 

AATTGACAGT TATTCGAAGT TACCGGGTAT CGGGATTAAG ACGGCTACCC 1550 

GTTTGGCCTT 1560 



55 

2 ) INFORMATION FOR SEQ ID NO: 1173 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2007 bases 
60 (B) TYPE: Nucleic acid 

845 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

(C) ACCESSION NUMBER: Z49096 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1173 



GAGTCCCGAT 
GATGAGAACT 

15 ACAATTTAAC 
AATC CTATGT 
TTTGGAGCAA 
AAAAGAGTTG 
CCAATCGTAG 

20 GCTCAGCTCC 
TGGAATGGAG 
TTACCTATGA 
GTATCGCAAC 
TCCGCTACAA 

2 5 TAAAAGGT AA 

ATTCTCGCTA 
AATCACTGAG 
AACCAGGATC 
AATACCTTCC 

3 0 GGATGCGACG 

GGATGATGAC 
AGTCTACTTG 
ACGCTTTAAA 
CTGGTCAACT 

35 GGGCAAGGAA 
TATTGC TAAT 
ATGATACTAA 
AATCCTGTTT 
AGTTGGGACG 

40 AGCCAATTAT 
GCTCAAATCG 
TTATATTTTC 
TCTTGTATGT 
GGAGAATTTG 

45 TCTCAATCTT 
CTTCTTATGC 
GAAGCCTTAC 
AAAGATTAAA 
AACAAGTTCT 

50 AGCTGGAAAA 
ACTGGAATTT 
CTAATACAGC 
TAATATG 



55 



60 



TGCTGAGGAT 
ATAAGTCAGC 
AAGGTTGCAG 
AAGAGAGCAA 
AGGGAAATGG 
GAAACTGCAG 
TTATCCAAAC 
ATGAAAATGA 
AGTTCCTTGA 
AAAGGATCGT 
AAACTGTGGA 
TCTTTCATGG 
GTATATGACC 
CCACCCAACG 
GACTTTGTTT 
AGCCTTTAAG 
CAAGTGGAGA 
ACTCGAGATT 
TTTCTCACAA 
AACAAAAAAT 
TTTGGGGTTC 
TCCAGCTGAT 
TTTCAGTGAC 
GATGGAGTTA 
CAATCAGTCT 
CCAAAGAGGC 
GACCCTCTAT 
AACAGTTCCT 
CTGATGAGAA 
TCAGTTGTGA 
AACGGTTCAA 
CCACCCCAAT 
CAATCTCCAG 
AATGCCTAGC 
GCCGAAATAT 
GAGACTTCTG 
CCTTTTATCG 
AAGAGAC TGC 
GAAGGTTCAG 
TATCAAAAAC 



GCAACCTCCT 
AACGGGTAAG 
AGGTCTTTCA 
CTCTCGCAAC 
GATTACCTAT 
AGGTCAAGGG 
GGACAATTTG 
AGATGGCAGC 
ACAGTATTCT 
CTGGGCAATA 
TGGCAAGGAT 
AAAC TC AG AT 
GCGACCTTGG 
ACCTACCTTT 
GGCGTGATAT 
GTCATGATGT 
ATACTTCAAT 
GGGATGTTAA 
GGTTTCGCTC 
GGGAGATGCT 
CAACTCGCTT 
AATATTGTTA 
AC AAAC AC AA 
TGCTGGAGCC 
GTACGTAAGT 
AGCAAGCACA 
ATGGAACTAT 
GGACAAAATG 
AAATGGAGGA 
CTATGAATCC 
CAGCCTGAGC 
CTTGGAGCGG 
CCAAAAATTT 
ATCAAGGATA 
TGTGCAACCA 
TAGAAGAAGG 
GATAAGGTAG 
CGAGACCTTT 
GTTCCGTTGT 
ATTAAAAAAA 



ATAATGTCTA 
ATTCTTTACG 
TAAGTATCTG 
CTAATCTCAA 
GCCAATATGA 
GATTGATTTT 
CTTCTAGTTT 
AAGAGCTTGC 
TGCAGGGACA 
TTGTACCCGG 
GTTTATACAA 
GGATGCCTTT 
TCAGTGCAAA 
AATGCAGATA 
TCTTTATCAA 
TAGCTTCTTC 
AGCAGTGAAT 
TGAGGGTTTG 
ACTCCAGTAA 
ACTTGGTTGG 
TGGCTTGACA 
GTATTGCTCA 
ATGCTTCGTG 
AAAATTTATA 
CACAAAAAGA 
ACTCGAAATC 
GTATAATCAC 
TAGCAGTTAA 
TACTTGGTTG 
TGCTGAAAAT 
ATTATTCAGG 
GCTTCAGCTA 
AGATAAAGTT 
TTTCACCTGG 
ATCGTTGTAG 
GACCAATCTT 
AAGAAATTCC 
GCTAAATGGT 
TCAGAAGCAA 
TTAAATTAAC 



TGCGGTCATT 5 0 

TAGAAAAAAC 100 

GACATGGAAG 150 

GCAAGTTTCC 2 00 

TGACTATCAA 250 

ACAACCAGTC 3 00 

T ATCGGTC T A 3 50 

TGGGAACTTC 400 

GACGGCATTA 450 

AACAGAACTG 500 

CATTGTCTAG 550 

CTAGAAAAAG 600 

G AC CGGTG AA 650 

CTAAAGAAGG 7 00 

AGTAACTATG 750 

TATTGATAAT 800 

TCAAAATAGC 850 

ACTACTGGTG 900 

TGTTGGAACG 950 

ATT ATC T AAA 1000 

GATGAATACG 1050 

AAGCTC ATTT 1100 

CCTTTAC AGC 1150 

AGTGCTATTT 1200 

AATAGTAGGA 1250 

ACATGATCTT 13 00 

TACACAGGAA 1350 

ATCCGGTACG 1400 

GTTCTACCAA 1450 

CCTGATTTTA 1500 

T ATC CAGTTG 1550 

TGAAAGAATC 1600 

ACGACAGAAT 1650 

TGAGTTGGCG 17 00 

GTACTGGAAC 1750 

GCACCAAACC 1800 

AGACATGTAT 185 0 

TGGATATTGA 1900 

GATGTTCGGA 1950 

TTTAGGAGAC 2000 
2007 



2) INFORMATION FOR SEQ ID NO: 1174 



(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 18 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1174 
GAACGTGGTG AAGTTCGC 18 

10 



2) INFORMATION FOR SEQ ID NO: 1175 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1175 
2 5 GTTACTGGTG TAGAAATGTT C 21 



30 



2) INFORMATION FOR SEQ ID NO: 1176 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1176 

40 

TACTGGTGTA GAAATGTTC 19 



45 2 ) INFORMATION FOR SEQ ID NO: 1177 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1177 

GCTCAACAAG TTCCAGATTA 20 



60 



847 



CA 02307010 2000-05-19 



2 ) INFORMATION FOR SEQ ID NO: 1178 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2456 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: NCTC8325 

(C) ACCESSION NUMBER: X52593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 8 



ATGAACTGAT TATACTTAAC ATTAAAAAAG ATGATAACAC CTTCTACACC 50 

TCCATATCAC AAAAAATTAT AACATTATTT TGACATAAAT AC T AC ATTTG 100 

20 TAATATACTA CAAATGTAGT CTTATATAAG GAGGATATTG ATGAAAAAGA 150 

TAAAAATTGT TCCACTTATT TTAATAGTTG TAGTTGTCGG GTTTGGTATA 200 

TATTTTTATG CTTCAAAAGA TAAAGAAATT AATAATACTA TTGATGCAAT 250 

TGAAGATAAA AATTTCAAAC AAGTTTATAA AGATAGCAGT TATATTTCTA 300 

AAAGCGATAA TGGTGAAGTA GAAATGACTG AACGTCCGAT AAAAATATAT 350 

25 AAT AG TTT AG GCGTTAAAGA TATAAACATT CAGGATCGTA AAATAAAAAA 400 

AGTATC TAAA AATAAAAAAC GAGTAGATGC TCAATATAAA ATTAAAACAA 450 

ACTACGGTAA CATTGATCGC AACGTTCAAT TTAATTTTGT TAAAGAAGAT 500 

GGTATGTGGA AGTTAGATTG GGATCATAGC GTCATTATTC CAGGAATGCA 550 

GAAAGACCAA AGCATACATA TTGAAAATTT AAAATCAGAA CGTGGTAAAA 600 

3 0 TTTTAGACCG AAACAATGTG GAATTGGCCA ATACAGGAAC ACATATGAGA 650 

TTAGGCATCG TTCCAAAGAA TGTATCTAAA - AAAGATTATA AAGCAATCGC 700 

TAAAGAACTA AGTATTTCTG AAG AC TAT AT CAACAACAAA TGGATCAAAA 7 50 

TTGGGTACAA GATGATACCT TCGTTCCACT TTAAAACCGT TAAAAAAATG 8 00 

GATGAATATT TAAGTGATTT CGCAAAAAAA TTTCATCTTA CAACTAATGA 850 

3 5 AACAGAAAGT CGTAACTATC CTCTAGAAAA AGCGACTTCA CATCTATTAG 900 

GTTATGTTGG TCCCATTAAC TC TG AAG AAT TAAAACAAAA AGAATATAAA 95 0 

GGC TATAAAG ATGATGCAGT TATTGGTAAA AAGGGACTCG AAAAACTTTA 1000 

CGATAAAAAG CTCCAACATG AAG ATGGC T A TCGTGTCACA ATCGTTGACG 1050 

ATAATAGCAA TACAATCGCA C AT AC ATT AA TAGAGAAAAA GAAAAAAGAT 1100 

40 GGCAAAGATA TTCAACTAAC TATTGATGCT AAAGTTCAAA AGAGTATTTA 1150 

TAACAACATG AAAAATGATT ATGGCTCAGG TACTGC TATC CACCCTCAAA 12 00 

CAGGTGAATT ATTAGCACTT GTAAGCACAC CTTCATATGA CGTCTATCCA 12 50 

TTTATGTATG GCATGAGTAA CGAAGAATAT AATAAATTAA CCGAAGATAA 13 00 

AAAAGAACCT CTGCTCAACA AGTTCCAGAT TACAACTTCA CCAGGTTCAA 1350 

45 CTCAAAAAAT ATTAACAGCA ATGATTGGGT TAAATAACAA AACATTAGAC 1400 

GATAAAACAA GTTATAAAAT CGATGGTAAA GGTTGGCAAA AAGATAAATC 1450 

TTGGGGTGGT TACAACGTTA CAAGATATGA AGTGGTAAAT GGTAATATCG 1500 

ACTTAAAACA AGCAATAGAA TCATCAGATA AC ATTTTC TT TGCTAGAGTA 1550 

GCACTCGAAT TAGGCAGTAA GAAATTTGAA AAAGGCATGA AAAAACTAGG 1600 

50 TGTTGGTGAA GATATACCAA GTGATTATCC ATTTTATAAT GCTCAAATTT 1650 

CAAACAAAAA TTTAGATAAT GAAATATTAT TAGCTGATTC AGGTTACGGA 17 00 

CAAGGTGAAA TACTGATTAA CCCAGTACAG ATCCTTTCAA TCTATAGCGC 1750 

ATTAGAAAAT AATGGCAATA TTAACGCACC TCACTTATTA AAAGACACGA 1800 

AAAACAAAGT TTGGAAGAAA AATATTATTT CCAAAGAAAA TATCAATCTA 1850 

55 TTAAATGATG GTATGCAACA AGTCGTAAAT AAAACACATA AAGAAGATAT 1900 

TTATAGATCT TATGCAAACT TAATTGGCAA ATCCGGTACT GCAGAACTCA 1950 

AAATGAAACA AGGAGAAAGT GGCAGACAAA TTGGGTGGTT TATATCATAT 2000 

GATAAAGATA ATCCAAACAT GATG ATGGC T ATTAATGTTA AAGATGTACA 2050 

AGATAAAGGA ATGGCTAGCT ACAATGCCAA AATCTCAGGT AAAGTGTATG 2100 

60 ATGAGCTATA TGAGAACGGT AATAAAAAAT ACGATATAGA TGAATAACAA 2150 
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AACAGTGAAG CAATCCGTAA CGATGGTTGC 
TATTAATAAG TGCTGTTACT TCTCCCTTAA 
TTGTATGTTG AAAGTGACAC TGTAACGAGT 
TTTCTTATTT GTAATTTCAG CGATAACGTA 
5 AGTTTAATAA ATTTAACGTT ATTCATTTGT 
TCCGTATTTA CCTTCTTCTA CCCATAATTT 
GCATGC 



TTCACTGTTT TATTATGAAT 2 200 

ATACAATTTC TTCATTTTCA 22 50 

CCATTTTCTT TTTTTATGGA 2 3 00 

CAATGTATTA C CTGGT AT AC 2350 

GTTCCTGCTA CAACTTCTTC 2400 

AAATGATATT GAAAGTGTAT 2450 

2456 



10 



2 ) INFORMATION FOR SEQ ID NO: 1179 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



20 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1179 
ATTTGGTGAC GGGTGACTTT 20 

2 ) INFORMATION FOR SEQ ID NO : 1180 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1180 
TCCACCGTTG CCAATCGCA 19 

40 

2 ) INFORMATION FOR SEQ ID NO: 1181 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1181 
55 AGCAGCTTAC TAGATGCCGT 20 



2 ) INFORMATION FOR SEQ ID NO: 1182 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1182 
AACTGCAAGA GATCCTTTGG ^ U 



15 2) INFORMATION FOR SEQ ID NO: 1183 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2535 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 175 

(C) ACCESSION NUMBER: M18729 



(i) 

20 



30 



"(xiT)" SEQUENCE DESCRIPTION :" SEQ ID'NO: 1183 



ATGGC GAT AG 
AAAGCAATAT 
3 5 AATTATTTTA 
TTAACGAGTC 
TCCCTATCAT 
ATAAGGTGGC 
GTTGTTAAAC 

40 TAGCAGTAAG 
AAGGCAATCA 
TATGTGACAG 
CCTCAAGGCT 
AACAAATCCT 

45 AGCTTTGAAG 
AACGGCATCT 
AATTGAACCA 
CAGATGGATT 
CTCAGGTAAG 

50 CGGCTATGGG 
GATAAGGAAC 
CCATTTCTTT 
ACATTGAGCG 
GATCTCTTGC 

55 GATTTTAGAA 
TGGATGCAAT 
GAAGCTCCTC 
TG AG AC TTT A 
TTGCTGAGAT 

60 AAGATTGACT 



AAAAGCTATC 

CCAGATGCTT 

TGAGGATGCG 

GCAACAAGAA 

TCTGCCCAAC 

TATCGCAGAG 

GAGAGGTTGT 

CCGGACAGTC 

ATTTGGCCTA 

GTCTTTTGGA 

CGAGAAGTGG 

CAGCCGCCAG 

ACCTTCATTT 

AGTAAGCTGC 

CCTCAAACCT 

ATGCGACCAA 

AAACAAGGCA 

GATGCGTCTC 

GAATCGTCCA 

GAGCGTAGTG 

CTTGGCTAGT 

AGTTGGCGAC 

GGGATGGAGC 

CCCTGAGTTG 

ATGTGATTAC 

GACAAGTATC 

TGAGGCTAAG 

ACAATAAAAA 



ACCCGGCATG 

TTTTGCTCTT 

GTCAATGCTG 

TGCCGACAAT 

AGTATATCGA 

CAGATGGAAG 

TCAGGTCATT 

AGAATAATTT 

GCTTATATGG 

TTTCACGCTG 

TGTTGGGTTA 

ATGAATC TGG 

ATTGGATTTG 

TCCAGTATGT 

GTTATCCGCT 

GGCTAGTCTG 

GTCTTTTCTG 

TTGCGTTCTT 

ACGTCAAGAA 

ACTTGACAGA 

CGTGTTTCTT 

TACCTTGTCT 

AACCTACTCT 

GAGAGTTTGA 

AGATGGGGGA 

GTTGCGTTCT 

GAGCGAGAAA 

GG ATGGC T AC 



CAACAGTATG 

TCGGATGGGT 

CGCAGATTCT 

CCGATCCCTA 

TGTCTTGATT 

ATCCTAAACA 

ACGCCAGGGA 

TTTGGTTTCC 

ATTTGGTGAC 

GTTTGTGGGG 

TGACTTGTCT 

TACTCTCTTA 

CGATTGGCAA 

TCATCGGACT 

ACGAAATTAA 

GATTTGGTTG 

GC TTTTGG AT 

GGATTCATCG 

GTAGTGCAGG 

CAGTCTCAAG 

TTGGCAAAAC 

AGTGTGCCAC 

AGCCTATCTC 

TTAGCGCAGC 

ATTATCCGGA 

CAGAGAAGGG 

ACTCTGGTAT 

TATTTTCATG 



TGGATATTAA 50 

GATTTTTATG 100 

GGAAATTTCC 15 0 

TGGCGGGTGT 200 

GAGCAGGGTT 25 0 

AGCAGTTGGG 300 

CAGTGGTCGA 350 

ATAGACCGCG 400 

GGGTGACTTT 450 

AAATCCGTAA 500 

GAGGAAGAAG 55 0 

TGAAAAAGAA 600 

CGGTGGAGCA 650 

CAGATGAGGG 7 00 

GG ATTTCTTG 7 50 

AGAATGCTCG 800 

GAAACCAAAA 850 

CCCCTTGATT 900 

TCTTTCTCGA 950 

GGTGTTTATG 1000 

C AATCC AAAG 105 0 

GGATTC GTGC 1100 

ATCGCACAAC 1150 

GATTGCTCCT 12 00 

CTGGATTTGA 1250 

ACTAGCTGGA 1300 

C AGCACGCTC 13 50 

TG AC C AATTC 1400 
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GCAACTGGGA AATGTGCCAG CCCACTTTTT CCGCAAGGCG ACGCTGAAAA 1450 

ACTCAGAACG CTTTGGAACC GAAGAATTAG CCCGTATCGA GGGAGATATG 15 00 

CTTGAGGCGC GTGAGAAGTC AGCCAACCTC G AAT AC G AAA TATTTATGCG 1550 

CATTCGTGAA GAGGTCGGCA AGTACATCCA GCGTTTACAA GCTCTAGCCC 1600 

5 AAGGAATTGC GACGGTTGAT GTCTTACAGA GTCTGGCGGT TGTGGCTGAA 1650 

ACCCAGCATT TGATTCGACC TGAGTTTGGT GACGATTCAC AAATTGATAT 17 00 

CCGGAAAGGG CGCCATGCTG TCGTTGAAAA GGTTATGGGG GCTCAGACCT 17 50 

ATATTCCAAA TACGATTCAG ATGGC AG AAG ATACCAGTAT TCAATTGGTT 1800 

ACAGGGCCAA ACATGAGTGG GAAGTCTACC TATATGCGTC AGTTAGCCAT 1850 

10 GACGGCGGTT ATGGC CCAGC TGGGTTCCTA TGTTCCTGCT GAAAGCGCCC 1900 

ATTTACCGAT TTTTGATGCG ATTTTTACCC GTATCGGAGC AGCAGATGAC 195 0 

TTGGTTTCGG GTCAGTCAAC CTTTATGGTG GAGATGATGG AGGCCAATAA 2000 

TGCCATTTCG CATGCGACCA AGAACTCTCT CATTCTCTTT GATGAATTGG 2 050 

GACGTGGAAC TGCAACTTAT GACGGGATGG CTCTTGCTCA GTCCATCATC 2100 

15 GAATATATCC ATGAGCACAT CGGAGCTAAG ACCCTCTTTG CGACCCACTA 2150 

CCATGAGTTG ACTAGTCTGG AGTCTAGTTT ACAACACTTG GTCAATGTCC 22 00 

ACGTGGCAAC TTTGGAGCAG GATGGGCAGG TCACCTTCCT TCACAAGATT 225 0 

GAACCGGGAC CAGCTGATAA ATCCTACGGT ATCCATGTTG CCAAGATTGC 2300 

TGGCTTGCCA GCAGACCTTT TAGCAAGGGC GGATAAGATT TTGACTCAGC 23 50 

2 0 TAGAGAATCA AGGAACAGAG AGTCCTCCTC CCATGAGACA AACTAGTGCT 2400 

GTCACTGAAC AGATTTCACT CTTTGATAGG GCAGAAGAGC ATCCTATCCT 245 0 

AGCAGAATTA GCTAAACTGG ATGTGTATAA TATGACACCT ATGCAGGTTA 2500 

TGAATGTCTT AGTAGAGTTA AAACAGAAAC TATAA 253 5 



25 

2 ) INFORMATION FOR SEQ ID NO : 1184 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 623 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 



40 (xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 1184 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGG AAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

45 CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 2 00 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 2 50 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CG ATAC G AAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 3 50 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 400 

5 0 GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTG AC TTGA CAGACAGTCT 55 0 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 

AAACCAATCC AAAGGATCTC TTG 623 

55 



2 ) INFORMATION FOR SEQ ID NO: 1185 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 621 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
10 (B) STRAIN: StrR-06 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1185 

^.^^p, rTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 
nc SSSSSw gtaacctcaa GGCTCGAGAA GTGGTGTTGG GTTATGACTT 
15 gaa^aacaaa" TCCTCAGCCG ccagatgaat ctggtactct 

SStoSS agaaagcttt gaagaccttc atttattgga tttgcgattg 

STa?Sgtgg agcaaacggc atctagtaag ctgctccagt atgttcatcg 
agggISSga accacctcaa acctgttatc cgatacgaaa 
™ Sm*;^*™ ottgcagatg gattatgcga ccaaggctag tctggatttg 
20 g?toagS?g SSSKS taagaaacaa ggcagtcttt tctggctttt 

SatgaaIcc aaaacggcta tggggatgcg tctcttgcgt tcttggattc 

?TrScCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 
oTnSSnSr TCGACCATTT CTTTGAGCGT AGTGACTTGA C AG AC AGTC T 
25 SSSSSE ?a?gacaSg AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 



30 2 ) INFORMATION FOR SEQ ID NO: 1186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 
, 5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1186 

45 TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 
G^AAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 
GTOTGA^GAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 
SStcAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

50 GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 
GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 
TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 
GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 
SaTGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 

« S??ScCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 
cIgGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 
CAaSSgTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 
AAACCAATCC AAAGGATCTC TT 

60 

852 
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2 ) INFORMATION FOR SEQ ID NO: 1187 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 



15 



(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1187 



TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

2 0 GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 2 00 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCGGT ATGTTCATCG 2 50 

GACTCAGATG AGGGAATTGA ACCACCTCAA AC C TGTTATC CGC TAC G AAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

25 GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGC TTTT 400 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 45 0 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 5 00 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGAC TTG A CAGACAGTCT 550 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 

3 0 AAACCAATCC AAAGGATCTC TT 622 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 



(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1188 



50 GGGTGACTTT 
AAATTCGCAA 
GAGGAAGAAG 
TGAGAAGGAA 
CTGTGGAGCA 

55 CAGATGCGGG 
AGATTTCTTA 
AGAATGCCCG 
GAAACCAAGA 
TCCTTTGATT 

6 0 TCTTTCTTGA 



TATGTAACGG 
TCTCAAGGCT 
AACAAATCCT 
GGCTTTGAGG 
AGCGGCAGCT 
AATTGAACCA 
C AG ATGG AC T 
TTCAGGCAAG 
CGGCTATGGG 
GATAAGGAGC 
CCACTTCTTT 



GGCTATTGGA 
AGAGAAGTGG 
CAGTCGTCAG 
AC CTTC ATTT 
AGTAAGCTCC 
CCTCAAACCA 
ATGCGACCAA 
AAGCAAGGCA 
AATGCGTCTC 
GAATCGTCCA 
GAGCGTAGTG 



TTTCACGTTG 
TGCTGGGTTA 
ATGAATCTGG 
ACTGGATCCA 
TCCAGTATGT 
GTTATCCGCT 
GGCTAGTCTG 
GTCTTTTCTG 
TTGCGTTCTT 
GCGTCAAGAG 
ATTTAACGGA 



GTTTGTGGGG 50 

TGACTTGTCT 100 

TGCTTTCTTA 150 

CGACTGGCAG 2 00 

TCACCGGACC 250 

ATGAAATCAA 3 00 

GATTTGGTTG 3 50 

GCTTTTAGAT 400 

GGATTCATCG 450 

GTGGTGCAGG 500 

CAGTCTTAAG 550 
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GGTGTTTATG ATATCGAACG CTTGGCTAGT CGGGTTTCTT TTGGCAAGA 599 



5 2 ) INFORMATION FOR SEQ ID NO: 1189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: _ . 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(x) SEQUENCE DESCRIPTION: SEQ ID NO: 1189 

20 GGTGACGGGT GACTTTTATG TGACAGGTCT TTTGGATTTT ACGCTGGTTT 50 

G^G^GAAAT CCGCAATCTC AAGGCTCGAG AAGTGGTGCT GGGTTATGAC 100 

TTGTCTGAGG AAGAAGAACA GATCCTTAGT CGTCAGATGA ATCTGGTACT 150 

TTCCTATGAA AAAGAAGGCT TTGAAGACCT TCATTTACTG GATTCACGAT 200 

„ TGGCAGCTGT GGAGCAAGCG GCATCTAGTA AACTGCTTCA GTATGTTCAT 250 

C^ACTCAGA TGAGGGAATT GAACCACCTC AAGCCTGTTA TCCGCTATGA 300 

AA?CAAAGAT TTTTTGCAGA TGGATTATGC GACCAAGGCT AGTCTGGATT 350 

TGGTTGAGAA TGCCCGTTCA GGCAAGAAGC AAGGTAGTCT TTTTTGGC TT 400 

TTGGATGAAA CCAAAACAGC TATGGGAATG CGTCTCTTGC GGTCTTGGAT 450 

30 TCATCGCCCC CTGATTGATA AGGAACGAAT TGTCCAACGC CAAGAAGTTG . 500 

TGCAGGTCTT TCTCGACCAT TTCTTTGAGC GTAGTGATTT GACAGACAGT 550 

CTCAAGGGTG TTTATGACAT TGAGCGCTTG GCTAGTCGTG TTTCTTTTGG 600 
CAAAACCAAT CCAAAGGATC TCTT 



35 

2 ) INFORMATION FOR SEQ ID NO: 1190 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: , 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1190 



TGACGGGTGA CTTTCAGGTG AC T AGTTT AG AGGACTTTGT CTTGGTCTGC 50 

GGGGAAATCC GCAATTTGAA AGCTAGGGAA GTGGTGCTGG GCTATGCCTT 100 

55 GCCAGAAGCT GAGGAGCAGG TTTTGGCTGG ACAGATGAAC CTTTTACTGT 150 

CCTATGTGGA GAAGGTTTTG GAGGATGTTC AGC TGCTGGG CGAGGAGCTG 200 

TCTC CTATGG AGCGTCAGGC AGCAGGGAAA CTGCTGGAGT ATGTGCACCG 250 

GACCCAGATG AGGGAGCTCA GCCATTTGAA GAAGGCTCAG CATTATGAAA 300 

TCAAGGACTT CCTGCAAATG GACTATGCCA CCAAGGCGAG TCTGGATTTG 3 50 

60 ACAGAAAATG CTCGCTCGGG CAAGAAGCAC GGCAGTCTTT ATTGGCTGAT 400 

854 
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GGACGAGACT AAGACGGCCA TGGGCGGCCG 
AGCGTCCGCT GATTGATGAA GCGCGAATTA 
GAGGTTTTTC TGGATCATTT CTTTGAGCGG 
CAAGGGGGTC TATGATATCG AGCGGCTGGC 



CATGCTGCGC TCTTGGATCC 450 

GCCAGCGACA GAATGTCGTT 500 

AGTGATTTGA CGGAGAGCCT 5 50 

TAGTCGGGTG TCTTTTGGC 599 



2 ) INFORMATION FOR SEQ ID NO: 1191 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mltis 
20 (B) STRAIN: ATCC 903 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1191 

TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGC CCTGGTCTGC 50 

25 GGGGAAATCC GTAATTTGAA GGCTAGGGAA GTGGTGCTGG GCTATGC TTT 100 

GCCAGAAGCT GAGGAGCAGG TCTTGGCTGG ACAGATGAAT CTTTTGCTGT 150 

C CTATGTAC A GACGGCCTTG GACGATGTCC AGCTGCTGGG CGAGGAACTG 2 00 

TCTCCTATGG AGCGTCAGGC AGCGGGGAAA TTGC TAG AGT ATGTGCACCG 25 0 

GACCCAGATG AGGGAGCTCA GCCATTTGAA GAAGGCCCAG CATTATGAAA 300 

30 TCAAGGACTT TCTGCAAATG GATTATGCTA C C AAGGC GAG TCTGGATTTG 350 

ACAGAAAATG CTCGCTCGGG TAAGAAACAC GGCAGTCTTT ATTGGCTGAT 400 

GG AC GAG AC C AAGACGGCCA TGGGCGGCCG TATGCTGCGC TCTTGGATCC 450 

AGCGTCCGTT GATTGATGAA GTGCGAATTA GCCAGCGGCA GAATGTCGTC 500 

GAGGTTTTTC TGGAACATTT CTTTGAGCGG AGTGATTTGA CGGAGAGCCT 550 

35 CAAGGGAGTC TATGATATCG AGCGGCTGGC TAGTCGGGTG TCTTTTGGC A 60 0 

AGACCAATCC AAAGGATCTC TT 622 



40 2 ) INFORMATION FOR SEQ ID NO: 1192 

( i ) SEQUENCE CHARAC TERI STICS : 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1192 

GGTAAAACAG GAACCTCTAA CT 22 



55 

2 ) INFORMATION FOR SEQ ID NO: 1193 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1193 
GGTAAGACAG GTACTTCTAA CT 



2) INFORMATION FOR SEQ ID NO: 1194 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1194 



CATTTCAAGT AATACAACAG AATC 



2) INFORMATION FOR SEQ ID NO: 1195 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1195 
CATTTCAAGT AACACAACTG AATC 



2 ) INFORMATION FOR SEQ ID NO: 1196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1196 
GCCATTTCAA GTAATACAAC AGAA 



856 
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10 



15 



30 



45 



55 



2 ) INFORMATION FOR SEQ ID NO: 1197 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1197 
CAAACGCCAT TTCAAGTAAT ACAAC 25 



2 ) INFORMATION FOR SEQ ID NO: 1198 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Staphylococcus saprophyticus 

(B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1198 



AACGGGCGTC TCGATAGAAA AACACGTGAA AATCCCAATG ATTATAAACA 50 

ATCAATATAC GATTTTGCTG AAGCTGTAAC AAAAGGTATT AAGGAACAAA 100 

3 5 CAAATAAAAA TTAATAGGCA ACTTAACCAG AATCGTTAAA AC TATATG AC 15 0 

GATTCTGGTT TTTTAAATTC AAAAAGTTTT CTAAAAAATT TACTTGCTTC 2 00 

TTTAAAGTAT AGGTATGAAA TACAATTGAT TAAAATAGTA AAGGAAATGA 250 

ATCATGAAAC AATTAACTAA GCCTTTATAC TTTTACCTAT TACTTTTTAT 300 

TACAACAACG CTGATTGGCG CGTTACTATT AT ATTTGC C A ATCACAKGTA 350 

40 AACATCCTAT TGATTTTGTG GACGCCCGTT A 3 81 



2) INFORMATION FOR SEQ ID NO: 1199 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1199 
GTATTAAAGA AGATATCCAA AAAGC 2 5 



60 2 ) INFORMATION FOR SEQ ID NO: 1200 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1200 
TCAAAGAAGA AACTAAAAAA GCTGT 



2 ) INFORMATION FOR SEQ ID NO: 1201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1201 
AAC GT AGGTG TCCTTCTTC 



2 ) INFORMATION FOR SEQ ID NO: 1202 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1202 
GTGTTGAAAT GTTCCGTAAA CA 



2 ) INFORMATION FOR SEQ ID NO: 1203 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1203 
GGIGARMGIG G I AAYGARAT G 
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2 ) INFORMATION FOR SEQ ID NO: 1204 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1204 

15 

GC I AAYAAC I TC I WMYATGC C 21 



20 2 ) INFORMATION FOR SEQ ID NO: 1205 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

2 5 <C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1205 

AAYAC ITC I A WYATGCC I GT 20 



35 



45 



50 



2 ) INFORMATION FOR SEQ ID NO : 12 06 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 06 

CKISRIGTIG ARTCIGCCA 19 

2 ) INFORMATION FOR SEQ ID NO: 12 07 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



60 (ii) MOLECULE TYPE: DNA 

859 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1207 
CCITCITCWC CIGGCATYTC 



2 ) INFORMATION FOR SEQ ID NO: 1208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 08 
TCAAAAAGTT TTCTAAAAAA TTTAC 



2) INFORMATION FOR SEQ ID NO: 1209 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1209 
ACGGGCGTCC ACAAAATCAA TAGGA 



2 ) INFORMATION FOR SEQ ID NO: 1210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1210 
. ACCAGCTTGC C C AATAC AAA GG 



2 ) INFORMATION FOR SEQ ID NO: 1211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1211 
ATTCTTGTAA CAGGCTTTGA TCCC 



2 ) INFORMATION FOR SEQ ID NO: 1212 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1212 
CCICCIRGIG GIGAIACIGC WCC 



2 ) INFORMATION FOR SEQ ID NO: 1213 

(i) SEQUENCE CHARACTERISTICS: 
—(-AO LENGTH-: — 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1213 
AARGGIGGIA CIGCIGCIAT HCCIGG 



2 ) INFORMATION FOR SEQ ID NO: 1214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1214 
GGTAAAACAG GTACCTCTAA CTA 



2 ) INFORMATION FOR SEQ ID NO: 1215 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1337 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: D471 

(C) ACCESSION NUMBER: X65717 

15 (xii) SEQUENCE DESCRIPTION: SEQ ID NO: 1215 

AACAAAATAA AAGAACTTAC CTATTTTCCA TCCAAAATGT TTAGCAATCA 50 

TCATCTGCAA GGCAACGTAT TGCATGGCAT TGATGTGATG AGCAACTAAT 100 

ATGTCATTAG AACGTTGCGT CAAACTAGCA TC TAAAT AAA GATCGAAATG 15 0 

20 CAGTTATCAA AAATGCAAGC TCCTATCGGC CCTTGTTTTA ATTATTACTC 2 00 

ACATTGCCTT AATGTATTTA CTTGCTTATT ATTAACTTTT TTG CTAAGTT 2 50 

AGTAGCGTCA GTTATTCATT GAAAGGACAT TATTATGAAA ATTCTTGTAA 3 00 

CAGGCTTTGA TCCCTTTGGC GGCGAAGCTA TTAATCCTGC CCTTGAAGCT 350 

ATCAAGAAAT TGCCAGCAAC CATTCATGGA GCAGAAATCA AATGTATTGA 400 

25 AGTTCCAACG GTTTTTCAAA AATCTGCCGA TGTGCTCCAG CAGCATATCG 450 

AAAGCTTTCA ACCTGATGCA GTCCTTTGTA TTGGGCAAGC TGGTGGCCGG 500 

ACTGGACTAA CGCCAGAACG CGTTGCCATT AATCAAGACG ATGCTCGCAT 550 

TCCTGATAAC GAAGGGAATC AGCCTATTGA TACACCTATT CGTGCAGATG 600 

GTAAAGCAGC TTATTTTTCA ACCTTGCCAA TCAAAGCGAT GGTTGCTGCC 650 

30 ATTCATCAGG CTGGGCTTCC TGCTTCTGTT TCTAATACAG CTGGTACCTT 7 00 

TGTTTGCAAT CATTTGATGT ATCAAGCCCT TTACTTAGTG GATAAATATT 750 

GTCCAAATGC CAAAGCTGGG TTTATGCATA TTCCCTTTAT GATGGAACAG 800 

GTTGTTGATA AACCTAATAC AGCTGCCATG AACCTCGATG ATATTACAAG 850 

AGGAATTGAG GCTGCTATTT TTGCCATTGT CGATTTCAAA GATCGTTCCG 900 

35 ATTTAAAACG TGTAGGGGGC GCTACTCACT GACTGTGACG C T ACT AAAC C 950 

TATTTTAAAA AAACAGAGAT ATGAACTAAC TCTGTTTTTT TTGTGCTAAA 1000 

AATGAAAGAC CTAGGGAAAC TTTTCATCGG TCTTTCTCAA TTGTCATCTT 1050 

AATCTAATAC TACTTCTAAC ATCAGCGGGT ATAGTTTGCC AGTAATTAAG 1100 

AAACGTTGTT GATCTAAATG AGCAATCCCA TTCAAAACAT TAAGGTCAGG 115 0 

40 GTAATGGGAC TTATCAAGAT TTAAGGCTTT T AAC AAAGG A CTAATATCAT 1200 

AGGTGGCTAC CACCTTTCCA GAATCAGGTT GGAGTTTGAC AATAGTATTG 1250 

GTTTGCCAAA TATTGGCATA GAGATAACCA TC T AC AT ACT CTAATTCGTT 13 00 

AAGCATTGAG ATAGGGACAC TTTCTATAGC AACTAGT 1337 

45 

2 ) INFORMATION FOR SEQ ID NO: 1216 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1216 
GGTAAGACTG GTACATCAAA CTA 23 

60 
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10 



20 



2 ) INFORMATION FOR SEQ ID NO: 1217 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1217 
15 CAAATGCCAT TTCAAGTAAC ACAAC 25 



2 ) INFORMATION FOR SEQ ID NO: 1218 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2 5 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1218 

30 

CAAACGCCAT TTCAAGTAAC ACAAC 25 



35 2 ) INFORMATION FOR SEQ ID NO: 1219 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1219 

CAAATGCTAT TTCAAGTAAT ACAAC 



50 

2 ) INFORMATION FOR SEQ ID NO: 1220 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1220 
CAAACGC CATTTCAAGT AATACGAC 2 5 

5 

2) INFORMATION FOR SEQ ID NO: 1221 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1221 
GAYACICCIG GICAYGTIGA YTT 23 

20 

2) INFORMATION FOR SEQ ID NO: 1222 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1222 
35 ATYGAYACIC CIGGICAYGT IGAYTT 2 6 



40 



2 ) INFORMATION FOR SEQ ID NO: 122 3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1223 
AYITCIARRT GIARYTCRCC CATICC 26 



55 2) INFORMATION FOR SEQ ID NO: 1224 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1224 
CCIGYIHTIY TIGARCCIAT IATG 



2) INFORMATION FOR SEQ ID NO : 1225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1225 
TAICCRAACA TYTCISMIAR IGGIAC 



2) INFORMATION FOR SEQ ID NO: 122 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1226 
GTIRMRTAIC CRAACATYTC 



2 ) INFORMATION FOR SEQ ID NO : 1227 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1227 
GTICCIYTIK CIGARATGTT YGGITA 



2 ) INFORMATION FOR SEQ ID NO: 1228 
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10 



15 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1228 
GTICCIYTIK C I GARATGTT YGGITAYGC 29 

2 ) INFORMATION FOR SEQ ID NO: 1229 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1229 
TCCATYTGIG CIGCICCIGT IATCAT 2 6 



2 ) INFORMATION FOR SEQ ID NO : 1230 

(_i )_ -SEQUENCE CHARAGTERrl-STICS : 

(A) LENGTH: 2145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Escherichia coli 

(D) ACCESSION NUMBER: X00415 

(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 123 0 

45 TGAACGCCTA AAAGATAAAC GAGGAAACAA ATGGCTCGTA CAACACCCAT 5 0 

CGCACGCTAC CGTAACATCG GTATCAGTGC GCACATCGAC GCCGGTAAAA 100 

CCACTACTAC CGAACGTATT CTGTTCTACA CCGGTGTAAA CCATAAAATC 150 

GGTGAAGTTC ATGACGGCGC TGCAACCATG GACTGGATGG AGCAGGAGCA 200 

GGAACGTGGT ATTAC CATC A CTTCCGCTGC GACTACTGCA TTCTGGTCTG 2 50 

50 GTATGGCTAA GCAGTATGAG CCGCATCGCA TCAACATCAT CGACACCCCG 3 00 

GGGCACGTTG ACTTCACAAT CGAAGTAGAA CGTTCCATGC GTGTTCTCGA 350 

TGGTGCGGTA ATGGTTTACT GCGCAGTTGG TGGTGTTCAG CCGCAGTCTG 400 

AAACCGTATG GCGTCAGGCA AACAAATATA AAGTTCCGCG CATTGCGTTC 450 

GTTAACAAAA TGGACCGCAT GGGTGCGAAC TTCCTGAAAG TTGTTAACCA 500 

55 GATCAAAACC CGTC TGGGCG CGAACCCGGT TCCGCTGCAG CTGGCGATTG 55 0 

GTGCTGAAGA ACATTTCACC GGTGTTGTTG ACCTGGTGAA AATGAAAGCT 600 

ATCAACTGGA ACGACGCTGA CCAGGGCGTA ACCTTCGAAT ACGAAGATAT 65 0 

CCCGGCAGAC ATGGTTGAAC TGGCTAACGA ATGGCACCAG AACCTGATCG 700 

AATCCGCAGC TGAAGCTTCT GAAGAGCTGA TGGAAAAATA CCTGGGTGGT 750 

60 GAAGAACTGA CTGAAGCAGA AATCAAAGGT GCTCTGCGTC AGCGCGTTCT 800 
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GAACAACGAA 
GTGTTCAGGC 
GACGTACCTG 
TGAACGTCAC 
5 TCGCTACCGA 
GGTGTGG TTA 
TGAGCGTTTC 
TCAAAGAAGT 
GTAACCACTG 
10 ACGTATGGAA 
CCAAAGCTGA 
GAAGACCCGT 
CATCGCGGGT 
AGCGTGAATT 
15 CGTGAAACTA 
ACAGTCTGGT 
CGCTGGAGCC 
AAAGGTGGTG 
CCAGGAACAG 
2 0 TGGGTATTCG 
CTGGCGTTTA 
AGCGAAACCA 
CGGAAGAGAA 
ATGCTCAAAG 
2 5 AGTACCGCTG 
CCAAAGGTCG 
CCGAGTAACG 



ATCATCCTGG 

GATGCTGGAT 

CGATCAACGG 

GCAAGTGATG 

CCCGTTTGTT 

AC TCTGGTG A 

GGTCGTATCG 

TCGCGCGGGC 

GTGACACCCT 

TTCCCTGAGC 

CCAGGAAAAA 

CTTTCCGTGT 

ATGGGCGAAC 

CAACGTTGAA 

TCCGCCAGAA 

GGTCGTGGTC 

GGGTTCAAAC 

TAATCCCTGG 

CTGAAAG C AG 

TCTGCACTTC 

AACTGGCTGC 

GTTCTGCTTG 

C AC CGGTG AC 

GTCAGGAATC 

TCTGAAATGT 

TGCATCATAC 

TTGCTCAGGC 



TAACCTGTGG 
GCGGTAATTG 
TATCCTGGAC 
ACGAGCCGTT 
GGTAACCTGA 
TACCGTACTG 
TTCAGATGCA 
GACATCGCTG 
GTGTGACCCG 
CGGTAATCTC 
ATGGGTCTGG 
ATGGACTGAC 
TGCACCTCGA 
GCGAACGTAG 
AGTTACCGAT 
AGTATGGTCA 
CCGAAAGGCT 
CGAATACATC 
GTCCGCTGGC 
GGTTCTTACC 
TTCTATCGCC 
AGCCGATCAT 
GTTATCGGTG 
TGAAGTT AC T 
TCGGATACGC 
ACTATGGAAT 
CGTAATTGAA 



TTCTGCGTTC 

ATTACCTGCC 

GACGGTAAAG 

CTCTGCACTG 

CCTTCTTCCG 

AACTCCGTGA 

CGCTAACAAA 

CTGCTATCGG 

GATGCGCCGA 

CATCGCAGTT 

CTCTGGGCCG 

GAAGAATCTA 

CATCATCGTT 

GTAAAC CGCA 

GTTGAAGGTA 

TGTTGTTATC 

ACGAGTTCAT 

CCGGCCGTTG 

AGGCTACCCG 

ATGACGTTGA 

TTTAAAGAAG 

GAAGGTTGAA 

ACTTGAGCCG 

GGCGTTAAGA 

AACTCAGCTG 

TCCTGAAGTA 

GCCCGTGGTA 



AAGAACAAAG 850 

ATCCCCGGTT 900 

ACACTCCGGC 950 

GCGTTCAAAA 1000 

TGTTTACTCC 1050 

AAGCTGCACG 1100 

CGTGAAGAGA 1150 

TCTGAAAGAC 1200 

TCATTCTGGA 1250 

GAACCGAAAA 1300 

TCTGGCTAAA 1350 

ACCAGACCAT 1400 

G AC CGTATGA 1450 

GGTTGCTTAC 1500 

AACACGCGAA 1550 

GACATGTACC 1600 

CAACGACATT 1650 

ATAAAGGTAT 1700 

GTAGTAGACA 1750 

CTCCTCTGAA 1800 

GCTTTAAGAA 185 0 

GTAGAAACTC 1900 

TCGTCGTGGT 1950 

TCCACGCTGA 2 000 

CGTTCTCTGA 2050 

TGATGAAGCG 2100 

AATAA 2145 



30 



35 



40 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO : 12 31 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1231 
GCGAGCCCGA AGATAAAAAA GAACCTCTGC TGCTCGC 

2 ) INFORMATION FOR SEQ ID NO: 1232 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1232 
GGAGCCGCGC GATTTTATAA ATGAATGTTG ATAACCGGCT CC 

867 
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20 



2 ) INFORMATION FOR SEQ ID NO: 1233 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1233 
15 GCGAGCGTTA CTGGTGTAGA AATGTTCCGG CTCGC 3 5 



2 ) INFORMATION FOR SEQ ID NO: 1234 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

30 

ACTAAATAAA CGCTCATTCG 2 0 



35 2 ) INFORMATION FOR SEQ ID NO: 1235 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

• (ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 123 5 

GCGAGCCGAA GTTGAAGTTG TTGGTATTGC TGGCTCGC 38 



50 



60 



2 ) INFORMATION FOR SEQ ID NO: 1236 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

868 



CA 02307010 2000-05-19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1236 
GCGAGCCGTG GTGAAGTTCG CGTTGGTGGC TCGC 



2 ) INFORMATION FOR SEQ ID NO: 1237 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1237 
GCGAGCCGCG AAATCGAAGT TGCTGTATTA GGGCTCGC 



2} INFORMATION FOR SEQ ID NO: 12 38 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 3 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1238 
GCGAGCGGCG TTAATTTTGG CACCGAAGAA GAGCTCGC 



2 ) INFORMATION FOR SEQ ID NO: 1239 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 33 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1239 
GCGAGCGCAG ACCTTTCAGC AGAGGAGGCT CGC 



2 ) INFORMATION FOR SEQ ID NO : 1240 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY : Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1240 
GCGAGCCGGC AAGACAATAT GACAGCAAAA TCGCTCGC 3 8 

2 ) INFORMATION FOR SEQ ID NO: 1241 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1241 
GCGAGCGGGG AACGAGGATG ATTTGATTGG CTCGC 3 5 

2) INFORMATION FOR SEQ ID NO: 1242 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 600 bases 
(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

3 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(E) STRAIN: BM4147-1 

40 (F) ACCESSION NUMBER: U39790 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1242 

TTCTTAGAGA CATTGAATAT GCCTTATGTC GGCGCAGGCG TATTGACCAG 50 

45 TGCATGTGCC ATGGATAAAA TCATGACCAA GTATATTTTA CAAGCTGCTG 100 

GTGTGCCGCA AGTTCCTTAT GTACCAGTAC TTAAGAATCA ATGGAAAGAA 150 

AATC CTAAAA AAGTATTTGA TCAATGTGAA GGTTCTTTGC TTTATCC G AT 200 

GTTTGTCAAA CCTGCGAATA TGGGTTCTAG TGTCGGCATT ACAAAGGCAG 25 0 

AAAACCGAGA AGAGCTGCAA AATGC TTT AG CAACAGCCTA TCAGTATGAT 300 

50 TCTCGAGCAA TCGTTGAACA AGGAATTGAA GCGCGCGAAA TCGAAGTTGC 350 

TGTATTAGGA AATGAAGATG TTCGGACGAC TTTGCCTGGC GAAGTCGTAA 400 

AAGACGTAGC ATTCTATGAT TATGAAGCCA AATATATCAA TAATAAAATC 450 

GAAATGCAGA TTCCAGCCGA AGTGCCGGAA GAAGTTTATC AAAAAGCGCA 500 

AGAGTACGCG AAGTTAGCTT ACACGATGTT AGGTGGAAGC GGATTGAGCC 550 

55 GGTGCGATTT CTTTTTGACA AATAAAAATG AATTATTCCT GAATGAATTA 60 0 



60 



2 ) INFORMATION FOR SEQ ID NO: 1243 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 
(C) ACCESSION NUMBER: M38386 

(xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1243 

GGTACCAAAG AAAAAAACGA ACGCCACAAC CAACAGCCTC TAAAGCAACA 50 

CCTGCTTCTG AAATTGAGGG AGATTTAGCA AATGTCAATG AGATTCTTTT 100 

GGTTCACGAT GATCGTGTCG GGTCAGCAAC GATGGGAATG AAAGTCTTAG 150 

AAGAAATTTT AGATAAAGAG AAAATTTCAA TGCCGATTCG AAAAATTAAT 200 

ATTAATGAAT TAACTCAACA AACACAGGCT TTAATTGTCA CAAAAGCTGA 250 

ACTAACGGAA CAAGCACGTA AAAAAGCACC GAAAGCGACA CACTTATCAG 300 

TAAAAAGTTA TGGTTAATCC CCAAAAATAT GAAACAGTGG GTTTCGCTCT 350 

TAAAAGAAAG TGCCTAGAGA GGAAGAAAAC AATGGAAAAT CTTACGAATA 400 

TTTCAATTGA ATTAAATCAA CAGTTTAATA CAAAAGAAGA AGCTATTCGC 450 

TTTTCCGGCC AGAAACTAGT CGAGGCAGGC TGTGTTGAGC CCGCTTATAT 500 

CGAAGCAATG ATTGAAAGAG ACCAATTGCT ATCTGCCCAT ATGGGGAATT 55 0 

TTATTGCCAT TCCTCATGGA ACAGAAGAAG CCAAAAAATT AGTGAAAAAA 600 

TCAGGAATCT GTGTAGTGCA AGTCCCAGAG GGCGTTAATT TTGGCACCGA 65 0 

AGAAGATGAA AAAATTGCTA CCGTATTATT TGGGATTGCC GGAGTCGGTG 700 

AAGAACATTT GCAATTAGTC CAACAAATTG C AC TTTATTG TAGTGATATG 75 0 

GATAAC GTGG TGCAACTTGC CGATGCATTA AGTAAAGAAG AAATAACAGA 800 

AAATTTAGCC ATTGCTTAAA GGAGAGAATA AGAATGAACG CAGTACATTT 850 

TGGAGCAGGA AATATTGGAC GCGGCTTTAT TGGCGAAATT TTAGCTAAAA 900 

CGGGTTTCAT ATTACCGTTT GTGGATGTTA ATGGAAACCA TCATCAAGCG 95 0 

TTAAAAGAAC GTAAAAGTTA TACAATTGAA TTGGCCGATG CCTCACATCA 1000 

ACAAATTAAC GTTGAAAATG TGACCGGGTT AAATAACATG ACAGAACCAG 1050 

AAAAAGTAGT AGAAGCAATT GCGGAAGCCG ATTTAGTCAC GACGGCAATT 1100 

GGTCCTAATA TTTTACCAAG AATTGC TGAA TTAATTGCTC AAGGAATTGA 1150 

TGCACGTGCC GAAGCAAATT GTCAAAACGG CCCGCTGGAT ATTATCGCTT 12 00 

GTGAAAATAT GATTGGTGGT TCAACCTTTT TAGCAGAAGA AGTGGCCATA 1250 

ATATTTGAAA AACCCAGCTT ATCTGAACAA TGGATTGGTT TTCCTGATGC 1300 

GGCAGTTGAT CGGATTGTTC CATTACAAAA ACATAAAGAT CCACTTTTTG 13 50 

TTCAAGTTGA GCCTTTTTGT GAATGGGTCA TTGATGATAC CAACCGAAAA 1400 

GCCAAAGAGA TTCAGTTAGA AGGCGTCATT ACTTGTCGAT TAGAGCCGTA 1450 

TATTGAACGA AAATTATTTA GTGTAACCAG TGGCCATGCT ACAGTTGCCT 1500 

ATACAGGGGC GTTGTTAGGC TATCAAACCA TTGACGAAGC GATGCAGGAC 1550 

GCCTTAGTGG TAGCGCAACT CAAATCAGTT TTGCAGGAAA CCGGTAAACT 1600 

TTTAGTGGCC AAATGGAATT TTGATGAACA AGAACATGCA GCCTATATTG 1650 

AAAAAATTAT CAACCGTTTC CAAAATAAAT ATATTTCAGA TGCTATTACA 1700 

CGTGTAGCAC GGACACCAAT CAGAAAATTA GGTGCGCAAG AACGGTTTAT 1750 

TCGACCAATC CGTGAATTAC AGGAACGCAA TCTAGTGTCG GCCGCATTTA 1800 

T™™° AT TGGT ATTGTC TTTAATTATC ATGATCCAGA AGATGAACAA 1850 

AGCCGTCAAT TACAGGAAAT GCTTGACCAA GAAAGTGTTG ATACAGTGGA 1900 

TCGCTGAAGT AACGGGCATT GAAGATCCAG AAACGGTTAA AAATATTAAA 1950 

CAAAACGTAG AACTGCTATG CGCGACCACA AGTAGCATAA TTAACAAAAT 2000 

CCTTCTACCA AGATACTTCA CATTTCTTAA TTAAAGAAAA AACAACCGCG 2 050 

CCTCACCTGA GCCGACCCCC AAAAGTTAGA CCTAGAAATC TAACTTTTGG 2100 

AGGTTTTTTT GTATGGCAAA ATACAGTTTT GAAATTTAAA CTTAAACTTG 2150 

^ATGACTA CTTATATGGT CAAGGAGGTC TAAGGTTTCT CGCAAAGAAG 2200 

TATGGGTTTA AAGATAGTCT CAAATAAGCA AATGGATAAA TGCCTATAAA 2250 
GAACTTGGTG AAGAAGGGGG GATCC i. A1AAA 
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2 ) INFORMATION FOR SEQ ID NO: 1244 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Staphylococcus aureus subsp . aureus 

(B) STRAIN: ATCC 25923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1244 

20 GATCAATCTT TGTCGGTACA CGATATTCTT CACGACTAAA TAAACGC TC A 50 

TTCGCGATTT TATAAATGAA TGTTGATAAC AATGTTGTAT TATCTACTGA 100 

AATCTCATTA CGTTGCATCG GAAACATTGT GTTCTGTATG TAAAAGCCGT 150 

CTTGATAATC TTTAGTAGTA CCGAAGCTGG TCATACGAGA GTTATATTTT 200 

CCAGCCAAAA CGATATTTTT ATAATCATTA CGTGAAAAAG GTTTCCCTTC 25 0 

2 5 ATTATCACAC AAATATTTTA GCTTTTCAGT TTCTATATCA ACTGTAGCTT 300 

CTTTATCCAT ACGTTGAATA ATTGTACGAT TC TGACGC AC CATC TTTTGC 350 

ACACCTTTAA TGTTATTTGT TTTAAAAGCA TGAATAAGTT TTTCAACACA 400 

ACGATGTGAA TCTTCTAAGA AGTCACCGTA AAATGAAGGA TC 442 

30 
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